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Abstract 

Interference alignment is a key technique for communication scenarios with multiple interfering links. In 
several such scenarios, interference alignment was used to characterize the degrees-of-freedom of the channel. 
However, these degrees-of-freedom capacity approximations are often too weak to make accurate predictions about 
' the behavior of channel capacity at finite signal-to-noise ratios (SNRs). The aim of this paper is to significantly 

strengthen these results by showing that interference alignment can be used to characterize capacity to within a 
O [ constant gap. We focus on real time-invariant frequency-flat X-channels. The only known solutions achieving the 

degrees-of-freedom of this channel are either based on real interference alignment or on layer-selection schemes. 
Neither of these solutions seems sufficient for a constant-gap capacity approximation. 
' In this paper, we propose a new communication scheme and show that it achieves the capacity of the Gaussian 

i X-channel to within a constant gap. To aid in this process, we develop a novel deterministic channel model. This 

deterministic model depends on the ilog(SNR) most-significant bits of the channel coefficients rather than only 
the single most-significant bit used in conventional deterministic models. The proposed deterministic model admits 
a wider range of achievable schemes that translate to a solution for the Gaussian channel. For this deterministic 
(713 I model, we find an approximately optimal communication scheme. We then translate the solution for the deterministic 

O . channel to the original Gaussian X-channel and show that it achieves capacity to within a constant gap. 

> ■ I. INTRODUCTION 

, Interference alignment has been shown to be a key technique to achieve optimal degrees-of-freedom 
(capacity pre-log factor) in several common wireless network configurations such as X-Channels [IJ, inter- 
ference channels interfering multiple-access and broadcast channels [4J, multi-user systems with 
^ delayed feedback Q-Q, and distributed computation [81, among others. The main idea of interference 
I alignment is to force all interfering signals at the receivers to be aligned, thereby maximizing the number 
^ of interference-free signaling dimensions. 
> 

^ _ A. Background 

I Alignment approaches can be divided into two broad categories. 

1) Vector-space alignment [T^, In this approach, conventional communication dimensions, such as 
time, frequency, and transmit/receive antennas, are used to align interference. At the transmitters, 
precoding matrices are designed over multiple time/frequency slots or transmit antennas such that 
interference at the receivers is aligned in a small subspace. At the receivers, zero-forcing is used to 
remove this subspace containing the interference. If the channel coefficients have enough variation 
across the utilized time/frequency slots or antennas, then such precoding matrices can be found. 

2) Signal-scale alignment l\3J, /i9|/: If the transmitters and receivers have only a single antenna and the 
channel coefficients are time invariant and frequency flat, the vector- space alignment method fails. 
Instead, one can make use of another resource, namely is the signal scale. Using lattice codes, the 
transmitted and received signals are split into several superimposed layers. The transmitted signals 
are chosen such that all interfering signals are observed within the same layers at the receivers. 
Thus, alignment is now achieved in signal scale. 
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Signal-scale interference alignment can be further subdivided into two different, and seemingly com- 
pletely unrelated, approaches: alignment schemes motivated by signal-strength deterministic models BH, 
[fTOl and real interference alignment 

For the deterministic approach, the channel is first approximated by a deterministic noise-free channel. 
In this deterministic model, all channel inputs and outputs are binary vectors, representing the binary 
expansion of the real valued signals in the Gaussian case. The actions of the channel are modeled by 
shifting these vectors up or down, depending on the most-significant bit of the channel gains, and by 
bitwise addition over the binary field of interfering vectors. The signal layers are represented by the 
different bits in the binary expansion of the signals. In the second step, the signaling schemes and the 
outer bounds developed for this simpler deterministic model are used to guide the design of efficient 
signaling schemes for the original Gaussian problem. 

This deterministic approach has proved instrumental in approximating the capacity of several chal- 
lenging multi-user communication networks. Using this approach, constant-gap capacity approximations 
were derived for example for single-multicast relay networks [fTTI . two-user interference channels with 
feedback 11121 or with transmit/receive cooperation IfTJI , Ifl4l . and lossy distributed source coding [ITSll . In 
all these communication scenarios, interference alignment is not required. For communication scenarios in 
which interference alignment is required, the deterministic approach has been less helpful. In fact, it has 
only been successfully used to obtain constant-gap capacity approximations for the fairly restrictive many- 
to-one interference channel, in which only one of the receivers experiences interference while all others 
are interference free [9|. Even for the X-channel, the simplest Gaussian network in which interference 
alignment is required, only weaker (generalized) degrees-of-freedom capacity approximations were derived 
using the deterministic approach [lOJ. The resulting communication scheme for the Gaussian X-channel 
in [fTOl is quite complicated and can not be used to derive a constant-gap capacity approximation. 

For the real interference alignment approach, each transmitter modulates its signal using an integer 
lattice transmit constellation. The transmit constellations are then scaled by a constant to ensure that at 
each receiver all interfering lattice constellations coincide, while desired lattice constellations are disjoint. 
Each receiver recovers the desired signals using a minimum-distance decoder. A number-theoretic result 
concerning the approximability of real numbers by rationals, called Groshev's theorem, is used to analyze 
the minimum constellation distance at the receivers. For almost all channel gains, this scheme is shown 
to achieve the full degrees-of-freedom of the Gaussian X-channel and the Gaussian interference channel 
[[3l. While this scheme is asymptotically optimal for almost all channel gains, there are infinitely many 
channel gains for which the scheme fails, for example when the channel gains are rational. Moreover, 
this approach can again not be used to derive stronger constant-gap capacity approximations. 

At first glance, real interference alignment appears to rely on the irrationally of the channel coefficients, 
preventing the desired integer input signals from mixing with the undesired integer interference signals. 
This raises the concern that the scheme might be severely affected by the presence of measurement 
errors or quantization of the channel coefficients. In addition, arbitrarily close to any irrational channel 
realization is a rational channel realization. How are we then to engineer a communication device based 
on this scheme? Quoting from Slepian's 1974 Shannon Lecture [16|: "Most of us would treat with great 
suspicion a model that predicts stable flight for an airplane if some parameter is irrational but predicts 
disaster if that parameter is a nearby rational number. Few of us would board a plane designed from such 
a model." 

Some of these concerns follow from the fact that real interference alignment is somehow isolated from 
other known signaling schemes and only poorly understood. Unlike the vector-space and the deterministic 
approaches, no vector space interpretation is known for real interference alignment, making it harder to 
obtain intuition. On the other hand, it is known that the degrees-of-freedom of the interference channel 
itself is an everywhere discontinuous function of the channel realizations [17|. It should therefore not be 
surprising that the rates achieved by real interference alignment share this characteristic. Rather, it appears 
that the degrees-of-freedom capacity approximation itself is too weak to make accurate predictions about 
the behavior of channel capacity at finite SNRs, and that the everywhere discontinuity of the degrees-of- 
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freedom in the channel coefficients are mainly caused by taking a limit as SNR approaches infinity. Thus, 
a stronger capacity approximation is needed. 

B. Summary of Results 

In this paper, we derive such a stronger capacity approximation for the Gaussian X-channel by char- 
acterizing its capacity to within a constant gap independent of the SNR and the channel gains, up to an 
(easily computable) "outage" set of channel realizations of arbitrarily small measure. In order to obtain this 
approximation result, we develop a novel deterministic channel model. In this deterministic model, each 
channel gain is modeled by a lower-triangular binary Toeplitz matrix. The entries in this matrix consist of 
the first ^ log(SNR) bits in the binary expansion of the channel gain in the corresponding Gaussian model. 
This contrasts with the traditional deterministic model, which is based only on the single most significant 
nonzero bit in the binary expansion of the Gaussian channel gain. The proposed deterministic model 
is rich enough to explain the real interference alignment approach. Thus, it unites the so far disparate 
deterministic and real interference alignment approaches mentioned above. Moreover, as our proposed 
deterministic model is based on a vector space, it enables an intuitive vector space interpretation of real 
interference alignment. This deterministic model allows us to design a fairly simple signaling scheme that 
achieves capacity of the deterministic channel up to a constant gap, as long as the binary channel matrices 
satisfy certain rank conditions. 

The solution for the proposed deterministic model is simple enough to be mimicked for the original 
Gaussian channel. We show that this scheme achieves the capacity of the Gaussian X-channel to within 
a constant gap. To prove achievability for the Gaussian case, we extend Groshev's theorem to handle 
finite SNRs as well as channel gains of different magnitudes. To derive the constant-gap approximation of 
capacity, we introduced a new outer bound on the sum capacity of the Gaussian X-channel. In addition, 
we show that, similarly to the Gaussian point-to-point channel, capacity is not sensitive to quantization 
and measurement errors in the channel gains smaller than SNR^^^^. 

One implication of these result is that the complicated solution achieving the degrees-of-freedom of 
the Gaussian X-channel in [10] is a result of oversimplification in the signal-strength deterministic model 
rather than the properties of the original Gaussian channel itself. Moreover, the results in this paper imply 
that the discontinuity of the degrees-of-freedom of the Gaussian X-channel with respect to the channel 
coefficients is due to the large SNR limit and is not present at finite SNRs. 

C. Organization 

The remainder of this paper is organized as follows. Section |Il] introduces the new deterministic channel 
model. Section Hill formalizes the Gaussian network model and the problem statement. Section HVl presents 
the main results of the paper — Sections |V] and |VI] contain the corresponding proofs. Section IVIIl contains 
the mathematical foundations for the analysis of the decoding algorithms. Section IVIIII concludes the 
paper. 

II. Deterministic Channel Models 

Developing capacity achieving communication schemes for multi-user communication networks is often 
challenging. Indeed, even for the relatively simple two-user interference channel, finding capacity is a 
long-standing open problem. For the Gaussian network, the difficulty is due to the interaction between the 
various components of these networks, such as broadcast, multiple access, and additive noise. For example, 
the two-user interference channel mentioned before has two broadcast links, two multiple-access links, 
and two additive noise components. 

Surprisingly, some results in the field of network information theory suggest that if the noise components 
are eliminated, so that the channel output at the receivers becomes a deterministic function of the channel 
inputs at the transmitters, then the problem of characterizing capacity can be substantially simplified fillil . 
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IfTSl . Such networks are called deterministic networks. This observation motivates the investigation of 
noisy networks by approximating them with deterministic networks [fTTTl . |fT9l . Il20| . 

This approximation has two potential advantages. First, the capacity of the deterministic network may 
directly approximate the capacity of the original Gaussian network. Second and more important, the 
deterministic model may reveal the essential ingredients of an efficient signaling scheme for the noisy 
network. In other words, the capacity achieving signaling scheme for the deterministic network may be used 
as a road map to design signaling schemes for the Gaussian network. If the deterministic approximation is 
well chosen, then the resulting signaling scheme for the Gaussian network is close to capacity achieving. 

The first critical step in this approach is thus to find an appropriate deterministic channel approximating 
the Gaussian one. This deterministic channel model should satisfy two criteria. 

1) Simplicity: The deterministic model should be simple to deal with. Obviously, we do not want to 
transform a difficult Gaussian problem into an equally difficult deterministic one. 

2) Richness: The deterministic model should keep the main features of the original problem. If the 
problem is oversimplified, then the solution for the deterministic model may not lead to a good 
solution for the Gaussian problem. 

These two requirements are conflicting. Indeed, oversimplification of the Gaussian model can sacrifice 
the richness of the deterministic model. Conversely, keeping too many of the features of the Gaussian 
model can result in a deterministic model that is rich but too difficult to analyze. Striking the right balance 
between these two requirements is the key to developing a useful deterministic network approximation. 

One of the approaches that achieves this goal is the signal- strength deterministic model proposed 
by Avestimehr et al. [11 J. We review this deterministic model in Section III-AI We introduce our new 
lower-triangular deterministic model in Section ITl-B[ Section ITl-CI compares the two deterministic models, 
explaining the shortcomings of the former and the need for the latter. 

A. Signal-Strength Deterministic Model lUTj 

We start with the real point-to-point Gaussian channel 



Here, n is a positive integer, and h G [1,2). Observe that all channel gains (and hence SNRs) greater 
than or equal to one can be expressed in the form 2"/i for n and h satisfying these conditions. If the 
magnitude of the channel gains is less than one, then capacity is less than one bit per channel use and 
hence not relevant for capacity approximation up to a constant gap. Moreover, since capacity is only a 
function of the magnitude of the channel gains, negative channel gains are not relevant here. Hence, ^ 
is essentially the general case. 

To develop the deterministic model and for simplicity, we assume that x[t] and z[t] are positive and 
upper bounded by one. We can then write x and z in terms of their binary expansion as 



y[t] ^2''hx[t] + z[t], 

with additive white Gaussian noise z[t] ~ A/'(0, 1) and unit average power constraint 



(1) 




t=i 
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The Gaussian point-to-point channel © can then be approximated as 



y= E[^]^-2" 

j=-oo 

^ 2"x + 2 



5^[x],2'^-^- + 5^([xW + [4.)2- 



■3 



j=i i=i 

n 



3=1 



or, more succinctly, 



_„ ^ for 1 < j < n, 



see Fig. 1(a) 



The approximation in this derivation is to ignore the impact of /i G [1, 2), the noise, as well as all bits 
[a;]„+i, [a:;]„+2, . . . with exponent less than zero. These bits with exponent less than zero are approximated 
as being completely corrupted by noise, whereas the bits with higher exponent are approximated as being 
received noise free. Therefore, we can approximate the Gaussian channel with a deterministic channel 
consisting of n parallel error-free links from the transmitter to the receiver, each carrying one bit per 
channel use. 

Having reviewed the signal- strength model for the point-to-point case, we now turn to the Gaussian 
multiple-access channel. In addition to the noise components in the point-to-point model, this channel 
model also incorporates signal interaction. Consider the Gaussian multiple-access channel 

y[t] ^ 2"/iiXi[t] + 2"/i2X2[t] +z[tl (3) 

where z[t\ ~ A/'(0, 1) is additive white Gaussian noise. As before, we impose a unit average transmit power 
constraint on xi[t\ and X2[t\- Moreover, n is a nonnegative integer, and /ii, /i2 G [1,2). The signal- strength 
deterministic model corresponding to the Gaussian channel ([3]) is 

[y]j-n ~ © [x2]j, for 1 < j < n, (4) 

where © denotes addition over Z2, i.e., modulo two. 

We note that in this model (i) the contribution of h is entirely ignored; (ii) real addition is replaced with 
bit-wise modulo two addition; (iii) noise is eliminated. As mentioned earlier, this simple model has been 
used to characterize the capacity region of several challenging problems in network information theory 
to within a constant gap. However, it falls short for some other settings. For example, for certain relay 
networks with specific channel parameters, this model incorrectly predicts capacity zero. Similarly, for 
interference channels with more than two users and for X-channels, this model fails to predict the correct 
behavior for the Gaussian case. 



B. Lower-Triangular Deterministic Model 

The signal- strength deterministic model, recalled in the last section, ignores the contribution of h E [1, 2) 
in the Gaussian point-to-point channel Indeed, h is approximated by 1. In this section, we introduce 
a new deterministic channel model, termed lower-triangular deterministic model, in which the effect of 
h is preserved. As we will see later, the new deterministic model admits a wider range of solutions — a 
fact that will be critical for the approximation of Gaussian networks with multiple interfering signals. 

Consider again the Gaussian point-to-point channel Write the channel parameter he [1, 2) in terms 
of its binary expansion 

00 

h = J2[h]3^-' = [h]o.mhUhh.... 

j=0 
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X y X y 

(a) Signal-strength deterministic model (b) Lower-triangular deterministic model 



Fig. 1. Comparison of the signal-strength deterministic model 11 11 , and the lower-triangular deterministic model proposed in this paper. 
In the figure, solid lines depict noiseless binary links of capacity one bit per second. Dashed lines depict noiseless links of either capacity 
one or zero bits per second (depending on whether the corresponding entry in the channel matrix H is one or zero). Links with the same 
color/shade have the same capacity. 



Observe that [h]o = 1, due to the assumption that h G [1, 2). Then, from ([T]), and we have 



y= J2iy]j^~' 

j=-oo 

OD CO OO 

j=0 1=1 j=l 

n j OO j+n 

j=i i=i 

n j 



j=i i=i 



j=i i=i 



so that 



[y]j-n ~ E for 1 < J < ^• 



i=l 

The approximation here is to ignore the noise as well as all bits in the convolution of [/i]o-[^]i[^]2 • • • and 
0.[a;]i[a;]2 . . . with exponent less than zero. These bits with exponent less than zero are approximated as 
being completely corrupted by noise, whereas the bits with higher exponent are approximated as being 
received noise free. 

This suggests to approximate the Gaussian point-to-point channel © by a deterministic channel between 
the binary input vector 



and the binary output vector 
connected through the channel operation 
with 

H 



y = {yi 1/2 ... i/n)"^ 



y = Hx, 



(5) 
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(a) Signal-strength deterministic model (b) Lower-triangular deterministic model 

Fig. 2. Permissible signaling schemes for both deterministic models. 




(a) Signal-strength deterministic model (b) Lower-triangular deterministic model 

Fig. 3. Illustration of a signaling scheme that succeeds for the lower-triangular model (assuming the subspace condition Q holds), but 
fails for the signal-strength model. 



as depicted in Fig. |l(b)[ Here, we have normalized the received vector y to contain the bits from 1 to n. 
This is a deterministic channel with finite input and output alphabets. Note that all operations in ([5]) are 
over Z2, i.e., modulo two. Similarly, the Gaussian multiple-access channel ([3]) can be approximated by 
the deterministic channel model 

y = HiXi © H2X2. (6) 



C. Comparison of Deterministic Models 

We now compare the signal-strength deterministic model reviewed in Section III-AI and the lower- 
triangular deterministic model introduced in Section III-BI As an example, we consider the Gaussian 
multiple-access channel ^ with signal strength n = 4. The corresponding deterministic models are given 
by dH) and Assume that transmitter one wants to send three bits ai, 02, and 03 to the receiver. At the 
same time, transmitter two wants to send one bit 61. 

In both deterministic models dH) and Q, transmitter one can use the first three layers to send ai, 02, 
and 03, while transmitter two can use the last layer to send bi, as shown in Fig. [2l For the signal- strength 
model, the decoding scheme is trivial. For the lower-triangular model, the receiver starts by decoding the 
highest layer containing only Oi. Having recovered oi, the receiver cancels out its contribution in all lower 
layers. The decoding process continues in the same manner with 02 at the second-highest layer, until all 
bits are decoded. 

There are, however, some signaling schemes that are only decodable in the lower-triangular model, but 
not in the signal-strength model. An example of such a signaling scheme is depicted in Fig. |3l In this 
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scheme, transmitter one uses again the first three layers to send ai, 02, and 03. Unlike before, transmitter 
two now also uses the first layer to send bi. From Fig. |3(a)[ we can see that, in the signal- strength model. 



receiver one observes ai©6i and cannot recover ai and bi from the received signal. However, this scheme 
can be utilized successfully in the lower-triangular model as long as the subspaces spanned by the message 
bits at the receivers are linearly independent. In this case, the subspace spanned by the first three columns 
of Hi and the subspace spanned by the first columns of H2 need to be linearly independent. This is the 
case if and only if 
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^ 0. (7) 



The event (|7]) depends not only on n, but also on the bits in the binary expansion of hi and /i2. Thus, 
this scheme is successful for all channel gains (hi, h2) G (1, 2]^ \ B, where B is the event that (|7]) does 
not hold. The set B can be understood as an outage event: If the channel gains are in B, the achievable 
scheme fails to deliver the desired target rate of 4 bits per channel use. 

Noting that the scheme depicted in Fig. |2(b)| always works while the scheme depicted in Fig. |3(b)| only 
works under some conditions, one might question the relevance of the new class of solutions. The answer 
is that it is precisely this class of signaling solutions that are valid for the lower-triangular model but 
invalid for the signal-strength model that are efficient for the X-channel to be investigated in Section |IVl 

As pointed out earlier, the second step in using the deterministic approach is to transfer the solution 
for the deterministic model to a solution for the original Gaussian model. We now show how this can 
be done for the signaling scheme shown in Fig. |3(b)[ The proposed scheme for the Gaussian multiple- 
access channel is depicted in Fig. |4l In this scheme, the input constellation at transmitter one is the set 
{0, 1/8, ... , 7/8}, and the input constellation at transmitter two is the set {0, 1/2}. Since the additive 
Gaussian receiver noise has unit variance, we expect the receiver to be able to recover the coded input 
signals roughly when 

2'^\hiui + hiU2 - hiu'i - h2u'2\ > 2 (8) 

for all Ui.u'i G {0, 1/8, . . . , 7/8}, ^2, ^2 G {0, 1/2} such that (mi,'U2) 7^ {u'l.u^). In words, we require 
the minimum constellation distance as seen at the receiver to be greater than two. 




2"(hiui +h2U2) 



Fig. 4. Modulation sclieme for the Gaussian model suggested by the signaling scheme for the lower-triangular deterministic model depicted 
in Fig. |3(b)| At the decoder, blue dots correspond to input tuples (iti, 0) with iti G {0, 1/8, . . . , 7/8}, and red dots correspond to input 
tuples (iti, 1/2) with ui G {0, 1/8, ... , 7/8}. Here, n = /ii = 1 and = 1/6. 



We note that condition ([8]) for the Gaussian channels corresponds to condition ([7]) for the deterministic 
model. As in the deterministic case, this scheme fails to work whenever the channel gains are in the set B 
not satisfying ([8]), and one can bound the Lebesgue measure of this outage event B. It is worth emphasizing 
that condition ([8]) has nothing to do with the rationality or irrationality of the channel coefficients. 

Remark: In the special case in which each transmitter has the same message size, the modulation 
scheme shown in Fig. |4]is the same as the modulation scheme used in real interference alignment [)3l|, 
ifTTl . The objective in ^ is to achieve only the degrees-of-freedom of the channel, and therefore the 
modulation scheme there is designed and calibrated for the high SNR regime. As a result, the modulation 
scheme in ISJ is not sufficient to prove a constant-gap capacity approximation. Rather, as we will see 
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in Section |Wl asymmetric message sizes and judicious layer selection guided by the proposed lower- 
triangular deterministic model together with a more careful and more general analysis of the receivers are 
required to move from the degrees-of-freedom to the constant-gap capacity approximation. 

III. Network Model 

In the remainder of the paper, we focus on the X-channel. This section formally introduces the channel 
model and capacity definitions. We start by introducing notational conventions in Section ITII-AI We intro- 
duce the symmetric X-channel in Section IIII-Bl and the general asymmetric X-channel in Section |III-C[ 

A. Notation 

Throughout this paper, we use small and capital bold font to denote vectors and matrices, i.e., x and H. 
For a real number a G R, we use (a)+ to denote max{a,0}. For a set i? C H^, fJ^^B) = fid{B) denotes 
(i-dimensional Lebesgue measure. For an event A, 1a denotes the indicator function of A. Finally, all 
logarithms are expressed to the base two. As a result, all capacities are expressed in bits per channel use. 

B. X-Channel with Symmetric SHRs 

The Gaussian X-channel with symmetric SNRs consists of two transmitters and two receivers. The 
channel output ym\t\ at receiver m G {1,2} and time t G IN is 

[t] ^ T'h^iXi [t] + T'hm2X2 [t] + [t] , (9) 

where Xk[t\ is the channel input at transmitter k G {1, 2}, where hmk is the (constant) channel gain from 
transmitter k to receiver m, and where Zm[t] ~ A/'(0, 1) is additive white Gaussian receiver noise. The 
channel gains consist of two parts, 2" and hmk- We assume here that n G Z+, and that hmk G (1, 2] for 
each m, k. Thus we see that 2^" captures the approximate SNR and hmk captures the fine structure of the 
channel gains. Since each channel gain is multiplied by the same number 2", each of the four links has 
the same approximate SNR — the general case with asymmetric SNRs is discussed in the next section. 

Each transmitter has one message to communicate to each receiver. So there are a total of four mutually 
independent messages Wmk with m, A; G {1, 2}. The message Wmk is uniformly distributed in {1, ... , Wmk}- 
An encoder of block length T at transmitter A; is a function 

/fc:{l,...,M/ife}x{l,...,W^2fc}^H^ 
mapping the messages wik and W2k into the sequence of channel inputs 

{Xk[t])'^=l = fk{Wik,W2k)- 

The rate of the encoder fk for message Wmk is 

The encoder is said to satisfy a unit average power constraint if 

t=i 

for every wi^, W2k- A decoder of block length T at receiver m is a function 

0™:R^^{l,...,iy^i}x{l,...,iy„2} 
mapping the sequence of channel outputs (ymM)^i into the message estimates 

{Wml,Wm2) = 4>m{{ym[t])J'=l) 
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of the messages {wmi, Wna)- The probability of error of a pair of encoders (/i, and decoders (0i, • 
is 

m,fc 

Definition. A sum rate R{n) is achievable, if for every e > and every T > Tq^s) there exists encoders 
(/17 12) with block length T, unit average power constraint, and rates ^^^^Rk > R{n), and there exists 
decoders ^2) with block length T and probability of error less than e. The sum capacity C{n) of the 
Gaussian X-channel ^ is the supremum of all achievable sum rates R{n). 

In the following, we will be interested in a particular modulation scheme for the Gaussian channel, 
which we describe next. Fix a time slot t\ to simplify notation, we will drop the dependence of variables 
on t whenever there is no risk of confusion. Assume each message Wmk is modulated into the signal Umk 
of the form 



^mk 



j=3 



with [umk]i G {0, 1}. Transmitter one forms the channel input 

Xi = h22Uu + hi2U21. 

Similarly, transmitter two forms the channel input 

X2 = hiiU22 + ^2lMl2- 

Since fi^^^j. < 4 and u^^^. < 1/16, this satisfies the unit average power constraint at the transmitters. 
The channel output at receiver one is then 

= 2"/lii/l22^^11 + 2"/li2/i21^^12 + 2"/Iii/Ii2(m21 + ^i22) + Zi, 



(10) 



(11a) 



(lib) 



(12) 



and similar at receiver two. Receiver one is interested in the signals uu and U12. The other two signals 
U21 and U22 are interference. We see from (fT2l) that the interfering signals U21 and U22 are received with 
the same coefficient /ii 1/^12 • 

It will be convenient in the following to refer to the effective channel gains including the modulation 
scheme as gmk, i-e.. 



910 = ^11^12; 

911 — ^11^22, 



9l2 



il2"'21; 



920 = ^22^21; 
^721 — ^21^12, 
^722 — ^22^11- 



(13a) 
(13b) 
(13c) 



Here gmk for m,k E {1,2} corresponds the desired signal Umk, and gmo for m G {1,2} corresponds to 
the interference terms. Since G (1,2], we have g^k G (1;4]. 

As in the discussion in Section III-B[ it is insightful to consider the lower-triangular deterministic 
equivalent of the modulated Gaussian X-channel (fT2l) . To simplify the discussion, we assume for the 
derivation and analysis of the deterministic channel model that the channel gains gmk are in (1, 2] — the 
Gaussian setting will be analyzed for the general case. Let 
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be the deterministic channel matrix corresponding to the binary expansion of the channel gain gmk with 
m E {1,2} and k E {0, 1, 2}. Since gmk £ (1; 2] by assumption, the diagonal values of Gmk are equal to 
one. The lower-triangular deterministic equivalent of the modulated Gaussian X-channel is then given by 

yi = GuUn © G12U12 © Gio{u2i © U22), (15a) 
y2 = G22U22 © G21U21 © G2o(*ii2 © uii), (15b) 

where the channel input Umk and the channel output ijm are all binary vectors of length n, and where all 
operations are over Z2EI 

As before, transmitter one has access to the messages wii,W2i, and transmitter two has access to 
W12, W22- The four messages are mutually independent, and Wmk is uniformly distributed over {1, . . . , Wmk}- 
Each transmitter k consists of twcH encoders fmk of block length T, with 

fmk : {1, • • • , Wmk} ^2 

mapping the message Wmk into the sequence of channel inputs 

i'^mk[t])'t=l — frnki^mk)- 

The rate of the encoder fmk is 

5 A ^OgjWmk) 
^mk rp 

A decoder of block length T at receiver m is a function 

(t>m ■■ Zr"" ^ {1, • • • , Wml] X {1, ... , Wm2] 

mapping the sequence of channel outputs {ym[t\)'t=i into the message estimates 

{Wml,Wm2) = 0m((^mM)Ll) 

of the messages {wmi, uim2)- The probability of error of a pair of encoders (/i, /2) and decoders ^2) 
is 

7^ Wmk})- 

m,k 

Definition. A sum rate R(n) is achievable, if for every e > and every T > To{e) there exists encoders 
(/ii; /12, /217 /22) with block length T, unit average power constraint, and rates J2mk^k > Rin), and 
there exists decoders ^2) with block length T and probability of error less than e. The sum capacity 
C{n) of the (modulated) deterministic X-channel (fTS]) is the supremum of all achievable sum rates R(n). 

'This definition of the deterministic model corresponds to a power constraint of 16 in the Gaussian model. This is mainly for convenience 
of notation. Since the additional factor 16 in power only increases capacity by a constant number of bits per channel use, this does not 
significantly affect the quality of approximation. 

^Observe that in the definition of capacity C{n) of the modulated deterministic X-channel l llSt we use two encoders at each transmitter 
(one for the each of the two messages). This differs from the definition of capacity C(n) of the Gaussian X-channel l|9), where we use 
a single encoder. Thus, in the deterministic case, we force the messages to be encoded separately, while we allow joint encoding of the 
two messages in the Gaussian case. This restriction is introduced because the aim of the deterministic model is to better understand the 
modulated Gaussian X-channel ill) , which already handles the joint encoding of the messages through the modulation process. 
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C. X-Channel with Arbitrary SNRs 

In the last section, we introduced the Gaussian X-channel with SNR of order 2^" over each of the four 
links. Thus, all links had approximately the same strength. We now introduce the Gaussian X-channel 
with arbitrary SNRs. The channel output at receiver m E {1, 2} and time t G IN is then 

^ 2"-i/i^iXi[t] + 2"™^/i„2X2[t] + Zrn[t], (16) 

where Xk[t] is the channel input at transmitter k E {1,2}, where 'I""^^hmk is the channel gain from 
transmitter k to receiver m, and where Zm^] ~ A/'(0, 1) is additive white Gaussian receiver noise. The 
channel gains consist again of two parts, 2"™*= and hmk- We assume that rimk £ ^+ and that h^k £ (1,2] 
for each m, k. Since 2^"^'^hmk varies over (2"™*, 2"™*+^] when hmk varies over (1, 2], we see that any real 
channel gain greater than one can be written in this form. As discussed in Section III-A[ channel gains 
with magnitude less than one are not relevant for a constant-gap capacity approximation, and hence are 
ignored here. Similarly, negative channel gains have no effect on the achievable schemes and outer bounds 
presented later, and are therefore ignored as well. In other words. Equation (fT6l ) models essentially the 
general Gaussian X-channel. As before, the parameter rimk captures the magnitude or coarse structure of 
the channel gain, the parameter hmk captures the fine structure of the channel gain. Denote by 

\n2i n22 J 

the collection of channel parameters. 

Each transmitter has again one message to communicate to each receiver. So there are a total of 
four messages, denoted by Wmk for m,kE {1,2}. We impose a unit average power constraint on each 
transmitted signal. Define the sum capacity C{N) of the Gaussian X-channel with arbitrary SNRs in 
analogy to the symmetric case. 

We use the same modulation scheme (fTOl ) as in Section IIII-BI to map the message Wmk into the real 
number Xmk- Transmitters one and two form the channel inputs 

Xi = h22Uu + hi2U21, 
X2 = huU22 + h2lUi2, 

The received signals are then given by 

yi = 2""/lii/l22Mll + 2""/li2/l2lMl2 + /^ll /il2 (2"" M21 + 2""M22) + Zi, 
y2 = 2"22/l22/illM22 + 2"^^/li2/l2lM21 + /i22/i21 (2"^^Ml2 + 2''''Un) + ^2- 

Writing g^k for the product of the channel gains as defined in (fT3] ). we can rewrite this as 

yi = T^'giiu^i + T^'gi2Ui2 + (7io(2""u2i + T''^U22) + ^i, 
y2 = 2"^^(722^^22 + 2"^^(72iM2i + (72o(2"^^ni2 + 2"^inn) + ^2- 

To gain intuition for the modulated Gaussian X-channel, we develop again the corresponding lower- 
triangular deterministic channel model. To simplify the presentation, we focus in the following on the 
case where the direct links are stronger than the cross link^ i.e., 

min{nii,n22} > max{ni2, n2i}. 

^This assumption is only made for ease of exposition. Since the labeling of the receivers is arbitrary, all results carry immediately over 
to the case min{ni2, n2i} > max{nii, 7122 }■ The models and tools developed in this paper for these two cases can be applied to the other 
cases as well. 
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Following the derivation in Section IIII-Bl we form the deterministic channel with inputs Umk ^ ^2'°'' 
for m,k E {1,2} and with channel outputs ym G Z2""" for m E {1,2}. It will be convenient to split the 
channel input into "common" and "private" portions, i.e., 



■mk 
P 

mk 



where E 11^^ and 1*^2 ^ ^2^^ ^ {I52}. The deterministic channel connecting Umk with y„ 

is then given by 



t/i = Giinii © G12 ( -c 
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(17a) 



(17b) 



see Figs. [5] and [6l Here, the lower-triangular binary matrices Gmk are defined in analogy to (fT4l) . The 
matrix Gi^ is of dimension x rin and G2fc is of dimension n22 x 71,22 for all A; G {0, 1, 2}. As in the 
symmetric case, we assume for the deterministic model that Qmk G (1,2]. As a consequence, the diagonal 
elements of Gmk are equal to one. 



"11 - "21 



Wll 



wf2 



-P 
"21 



"22 



nil — ni2 



ni2 



Fig. 5. Deterministic model at receiver one. For simplicity, the matrices Gmk are omitted. The interference terms U21 and U22 are observed 
at receiver one multiplied by the same matrix Gio. The desired terms un and U12 are multiplied by different matrices Gn and G12, 
respectively. 
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Fig. 6. Deterministic model at receiver two. The matrices Gmk are again omitted. The interference terms uw and 1x12 are observed 
at receiver one multiplied by the same matrix G20- The desired terms U21 and U22 are multiplied by different matrices G21 and G22, 
respectively. 

The definition of an achievable sum rate R{N) and of capacity C{N) for the deterministic X-channel 
with arbitrary SNRs is in analogy to the symmetric case. 



IV. Main Results 

The main result of this paper is a constant-gap approximation for the capacity of the Gaussian X- 
channel. To simplify the presentation of the relevant concepts and results, we start with the analysis of 
the Gaussian X-channel with symmetric SNRs in Section ITV-A[ We then consider the Gaussian X-channel 
with arbitrary SNRs in Section HV-BI 
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A. X-Channel with Symmetric SNR* 

We start with the analysis of the deterministic X-channel — as we will see in the following, the insights 
obtained for this model carry over to the Gaussian X-channel. The capacity C{n) of the symmetric 
deterministic X-channel is characterized by the next theorem. 

Theorem 1. For every 6 G (0, 1] and n G Z+, there exists a set Bn C (1, 2]^^^ of Lebesgue measure 

such that for all channel gains (gmk) G (1, 2]^^^\-B„ the sum capacity C{n) of the (modulated) symmetric 
deterministic X-channel (fTSi) satisfies 

|n-21og(ci/(5) < C{n) < |n 

for some finite positive universal constant C\. 

Theorem [U is a special case of Theorem [5] presented in Section IIV-BI We hence omit the proof of 
Theorem [H 

Theorem [T] approximates the capacity of the modulated deterministic X-channel (fT5l) up to a constant 
gap for all channel gains g^k G (1)2] except for a set i?„ of arbitrarily small measure. As will become 
clear in the following, the event ((^mfc) G -Bn can be interpreted as an outage event, as in this case the 
proposed achievable scheme fails to deliver the target rate of \n — 21og(ci/5). Here b parametrizes the 
trade-off between the measure of the outage event and the target rate: decreasing b decreases the measure 
of the outage event B^, but at the same time also decreases the target rate |n — 21og(ci/5). We point 
out that b can be chosen independently of the number of input bits n, hence the approximation gap is 
uniform in n. 

Theorem [T] can be used to derive the more familiar result on the degrees-of-freedom of the deterministic 
X-channel. Setting b = n^"^ results in the measures fi{Bn) < n^'^ to be summable over n G Applying 
the Borel-Cantelli lemma yields then the following corollary to Theorem [T] 

Corollary 2. For almost all channel gains (gmk) ^ (1,2]^^'^ the sum capacity C{n) of the (modulated) 
symmetric deterministic X-channel (fTSi) satisfies 

lim = 4/3. 

n— >-oo n 

Corollary |2] states that for almost all channel gains the deterministic X-channel has 4/3 degrees-of- 
freedom. We emphasize that, while Corollary [2] is simpler to state and perhaps more familiar in form. 
Theorem [T] is considerably stronger. Indeed, Theorem [T] provides the stronger constant gap capacity 
approximation for the sum capacity C{n), whereas Corollary [2] provides the weaker degrees-of-freedom 
capacity approximation. Moreover, Theorem [T] provides bounds for finite n on the measure of the outage 
event whereas Corollary [2] provides only asymptotic information about its size. 

We now describe the communication scheme achieving the lower bound in Theorem [T] (see Fig.|7]). Use 
the first R components of each vector Umk to transmit information, and set the last n — R components 
to zero. The rate of this communication scheme is hence 4_R. Receiver one is interested in uu and Ui2. 
These vectors are received in the subspace spanned by the first R columns of Gn and Gu, respectively. 
On the other hand, the messages U21 and 1*22 that receiver one is not interested in, and that can hence be 
regarded as interference, are both received in the same subspace spanned by the first R columns of Giq. 
Thus, the two interference vectors are aligned in a subspace of dimension R. The situation at receiver 
two is similar. 

Assume at receiver one the three subspaces spanned by the first R columns of Gu, G12, and Gio were 
linearly independent. Then the receiver could recover the two desired vectors by projecting the received 
vector into the corresponding subspaces in order to zero force the two interfering vectors. We show that 
for most channel gains this linear independence of the three subspaces holds for R ^ n/3. The outage 
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event i?„ is thus precisely the event that at either of the two receivers the three subspaces spanned by the 
first R columns of Gmi, Grn2, and Gmo not linearly independent. 

The results for the deterministic X-channel suggest that the modulation scheme (fTTI) should achieve a 
sum rate of 

3R ^ |n±0(l) 

as n — 7- oo. Furthermore, it suggests that a n-hit quantization of the channel gains hmk available at both 
transmitters and receivers should be sufficient to achieve this asymptotic rate. This intuition turns out to 
be correct, as the next theorem shows. 

Theorem 3. For every 6 E (0, 1] and n G Z+, there exists a set Bn C (1,2]^^^ of Lebesgue measure at 
most 

KBn) < 6 

such that for all channel gains {hmk) G (1,2]^^^ \ En the sum capacity of the symmetric Gaussian 
X-channel ^ satisfies 

|n - 2 log(c2/5) < C{n) < |n + 4 

for some finite positive universal constant €2- Moreover, the lower bound is achievable with a n-bit 
quantization of the channel gains hmk available at both transmitters and receivers. 

Theorem [3] is a special case of Theorem [6] presented in Section IIV-BI We hence omit the proof of 
Theorem [3l 

Theorem [3] provides a constant-gap capacity approximation for the symmetric Gaussian X-channel. 
The constant in the approximation is uniform in the channel gains h^k ^ (1;2] except for a set _B„ of 
arbitrarily small measure, and uniform in the power constraint n. The event Bn can again be interpreted 
as an outage event, and 5 parametrizes the trade-off between the measure of the outage event Bn and the 
target rate of the achievable scheme. Since 5 can be chosen independently of n, the approximation gap 
is uniform in the SNR, i.e., uniform in 2^". 

Remark: It is worth pointing out that the set Bn can be explicitly computed: Given channel gains hmk, 
there is an algorithm that can determine in bounded time if these channel gains are in the outage set 
More precisely, Bn is the union of 2®*^") diagonal strips (see the proof of Lemma [14] in Section IVII-BI 
and Fig. [17] there). Membership of (hmk) in the outage set Bn is mostly determined by the 6(n) most- 
significant bits in the binary expansion of the channel gains hmk- In particular, for any finite n (and hence 
finite SNR), the question of rationality or irrationality of the channel gains hmk is largely irrelevant to 
determining membership in 

The theorem shows furthermore that the proposed achievable scheme for the Gaussian X-channel is 
not dependent on the exact knowledge of the channel gains, and a quantized version, available at all 
transmitters and receivers, is sufficient. In fact, the scheme achieving the lower bound uses mismatched 
encoders and decoders. The encoders perform modulation with respect to the wrong channel model 

ym[t] = 2''hmlXi[t] + 2''hm2X2[t] + Zm[t], (18) 
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where hmk is a ra-bit (or, equivalently, | log(SNR)-bit) quantization of the true channel gain hmk- In other 
words, the channel inputs are 

Xi[t] = h22Uu[t] + hi2U2l[t], 
X2[t] = huU22[t] + h2lUi2[t]. 

The decoders perform maximum-likelihood decoding also with respect to the wrong channel model (fTSl ). 
Thus, both the encoders and the decoders treat the channel estimates as if they were the true channel 
gains. This shows that the proposed achievable scheme is actually quite robust with respect to channel 
estimation and quantization errors. 

As before, we can use Theorem |3] to derive more familiar results on the degrees-of-freedom of the 
Gaussian X-channel. Consider a sequence of SNRs 2^" indexed by n G Z+, and set 6 = n^^. Then the 
measures fx{Bn) < n^"^ are summable over n E Z+. Applying the Borel-Cantelli lemma as before yields 
the following corollary to Theorem |3l 

Corollary 4. For almost all channel gains {hmk) ^ (1,2]^^^ the sum capacity C{n) of the symmetric 
Gaussian X-channel ^ satisfies 

lim = 4/3. 

n— >cjo n 

Since the SNR of the channel is approximately 2^" so that ^ ~ | log(SNR), Corollary |4] shows that the 
Gaussian X-channel has 4/3 degrees-of-freedom for almost all channel gains hmk, recovering the result 
in [i3|. We emphasize again that Theorem [3] is considerably stronger than Corollary |4l Indeed, Theorem [3] 
proves the constant-gap capacity approximation 

\C{n) - ln\ < 0(1) 

with pre-constant in the 0(1) term uniform in the channel gains hmk outside B„. This is considerably 
stronger than the degrees-of-freedom capacity approximation in Corollary IH which shows only that 

\C{n) - |n| < o(n) 

with pre-constant in the o{n) term depending on h^k- Moreover, Theorem [3] provides bounds on the 
measure of the outage event for finite SNRs, not just asymptotic guarantees as in Corollary IH 



B. X-Channel with Arbitrary SMRs 

In the last section, we considered the Gaussian X-channel with SNRs across each link of order 2^". 
Thus, all links had approximately the same strength. We now turn to the Gaussian X-channel with arbitrary 
SNRs. As before, we start with the analysis of the deterministic X-channel. The next theorem provides 
an approximate characterization of the sum capacity C{N) of the general deterministic X-channel with 
bit levels N. 

Theorem 5. For every 5 G (0, 1] and N G Z^^^ such that min{nii,n22} > max{ni2, ?t.2i} there exists a 
set B C (1,2]^^^ of Lebesgue measure 

fi{B) < 6 

such that for all channel gains {gmk) £ (1? 2]^^'^ \ B the sum capacity C{N) of the (modulated) general 
deterministic X-channel (flTl) satisfies 

D{N) -2\og{ci/6) < C{N) < D{N) 

for some finite positive universal constant C\, and where 

D{N) ^ min {D,{N), L»2(iV), D^iN), D,{N)} + (r^ - n2i) + (^22 - n^) 
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and 

Di{N) = {ni2 + n2i - nu)^ + {n^ + ^21 - n22)^, 
D2{N) = i(ni2 + + (ni2 + ^21 - n22)+), 
D3{N) = i(ni2 + ^21 + (ni2 + ^21 - nii)+), 
D4(Ar) = 1(^12 + r22i). 

The proof of Theorem [5] is presented in Section |Vl For the special case of symmetric channel SNRs, 
TT-mk = n for all m, k. Theorem [5] reduces to Theorem [T] in Section HV-Ai 

We now provide a sketch of the communication scheme achieving the lower bound in Theorem [5] (see 
Figs. [8] and (9]). Observe that the riu — 7221 least significant bits u^^ of uu are not visible at the second 



"21 



1 


Rii 












R21 




R22 


1 
















Rii 













Itll U12 U21 U22 



Fig. 8. Allocation of bits at receiver one. Here, un and urz are the desired bits and are received multiplied by the matrices Gn and G12 
(not shown in the figure), respectively. The vectors 1121 and 1122 are interference and are both received multiplied by the same matrix Gio- 
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Fig. 9. Allocation of bits at receiver two. Here, U21 and U22 are the desired bits and are received multiplied by the matrices G21 and G22 
(not shown in the figure), respectively. The vectors u\\ and 1112 are interference and are both received multiplied by the same matrix G2o- 



receiver. Therefore, we can use these bits to privately carry riu — n2i bits from the first transmitter to the 
first receiver without affecting the second receiver. The rate of this private message is denoted by R^^. 
The remaining rate is denoted by R^^, i.e., 



where 



-Rii — Rii + Rii, 



-Rll = ^11 - ^^21- 



Hi 

Similarly, the n22 — ni2 least significant bits U22 of '1*22 are not visible at the second receiver. Therefore, 
we can use this part to privately carry n22 — ^12 bits from the second transmitter to the second receiver 
without affecting the first receiver. The rate of this private message is denoted by _R22- The remaining rate 
is denoted by ^22' i-^-' 

-R22 — -R22 + -^225 
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where 

R22 = T^22 - nu- 
ll remains to choose the values of R^i, R22, R12, and R21. Our proposed design rules are as follows. 

• We dedicate the R^i most significant bits of Uu to carry information from transmitter one to receiver 
one. 

• Similarly, we dedicate the R22 most significant bits of U22 to carry information from transmitter two 
to receiver two. 

• We always set the n22 — ?^2i rnost significant bits of U12 to zero. The next R12 bits of 1*12 carry 
information from transmitter two to receiver one. As shown in Fig. [9l this guarantees the (partial) 
alignment of U12 with Un at the second receiver. 

• We always set the nu — ni2 most significant bits of U21 to zero. The next R21 bits of U21 carry 
information from transmitter one to receiver two. As shown in Fig. [H this guarantees the (partial) 
alignment of U21 with U22 at the first receiver. 

Optimizing the values of the rates Rmk subject to the condition that both receivers can decode the 
desired messages, yields the lower bound in Theorem |5l The details of this analysis can be found in 
Section IV-Al 

Generalizing these ideas from the deterministic to the Gaussian model, we obtain the following constant- 
gap capacity approximation for the Gaussian X-channel with general asymmetric channel gains. 

Theorem 6. For every 5 G (0, 1] and N G Z^^^ such that minjnn, '^22} > max{?7,i2, ^21} there exists a 
set B C (1,2]^^^ of Lebesgue measure 

^i{B) < 6 

such that for all channel gains (hmk) G (1,2]^^^ \ B the sum capacity C{N) of the general Gaussian 
X-channel (fT6l) satisfies 

D{N) - 2 log(c2/5) < C{N) < D{N) + 4 
for some finite positive universal constant 02, and where 

D{N) ^ min {D,{N), D2{N), D:,{N), D,{N)} + (nn - n2i) + (^22 - nu) 

and 

Di{N) = {ni2 + n2i - nii)+ + {riu + ^21 - ^22)"^, 
D2{N) = i(ni2 + n2i + {riu + ^21 - 7^22)"^), 
D3{N) = i(ni2 + n2i + (nu + ^21 - ^^11)+), 
D,{N)^l{nu + n2i). 

Moreover, the lower bound on C{N) is achievable with a ma:x{nmk}-bit quantization of the channel 
gains hmk available at both transmitters and receivers. 

The proof of Theorem |6] is presented in Section |Vll For the special case of symmetric channel SNRs, 
nmk = n for all m,k. Theorem |6] reduces to Theorem [3] in Section |IV-A[ We point out that D{N) in 
Theorem |6] for the Gaussian X-channel is equal to D{N) in Theorem |5] for the deterministic X-channel. 
Thus, up to a constant gap, the Gaussian X-channel and its lower-triangular deterministic approximation 
have the same capacity. 

The lower bound in Theorem |6] is achieved by encoders and decoders that have access to only a 
max{?2mfc}-bit quantization hmk of the channel gains hmk- As before, the encoders and decoders are 
mismatched, in the sense that they are operating under the assumption that hmk is the correct channel 
gain. This shows again that the proposed communication scheme is quite robust with respect to channel 
estimation and quantization errors. 
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V. Proof of Theorem [5] (Deterministic X-Channel) 

This section contains the proof of the capacity approximation for the deterministic X-channel in 
Theorem [51 Achievability of the lower bound in the theorem is proved in Section IV-A[ the upper bound 
is proved in Section |V-B[ 

A. Achievability for the Deterministic X-Channel 

This section contains the proof of the lower bound in Theorem[5l Without loss of generality, we assume 
that > ^11- We use the achievable scheme outlined in Section IIV-BI (see Figs. [8] and |9] there). We 
want to maximize the sum rate 

Rii + + + R22 ~^ -^12 + R21, 

where 

Rkk + Rkk = Rkk 

is the total rate from transmitter k to receiver k. The constraint is that each receiver can solve for its own 
desired messages plus the visible parts of the aligned interference bits. 

If the subspaces spanned by the columns of Gmk corresponding to information-bearing bits of Umk are 
linearly independent, then there exists a unique channel input to the deterministic X-channel that results 
in the observed channel output. The decoder declares that this unique channel input was sent. The next 
lemma provides a sufficient condition for this linear independence to hold and hence for decoding to be 
successful. 

Lemma 7. Let 5 G (0,1] and N E 1?^'^ such that min{nii,n22} > max{ni2, "^,21}. Assume R^^,R^^, 
R12, R21, R22' R22 ^ ^+ satisfy 

R^, + max{i?2i, R22} + R12 + Rii < nn ~ log(32/5), (19a) 
l{max{ii,.,i?.„KCj>o} ■ ( max{i?2i, R'^2} + ^12 + ^ii) < n,2 - log(32/5), (19b) 

l{i?i2>o} • (^12 + ^n) < ^12 + ^21 - ^22, (19c) 

and 

Rg + max{i?i2, + i?2i + R22 < "22 - log(32/5), (20a) 

l{max{R,i,;?i„iiCj>o} " ( max{/?i2, R^A + R21 + R^) < ^^21 - log(32/5), (20b) 

l{i?2i>o} ■ (^21 + ^22) < ^12 + ^^21 - '^ii- (20c) 

Then the bit allocation in Section \IV-B\ for the (modulated) deterministic X-channel (flTI) allows successful 
decoding at both receivers for all channel gains (gmk) ^ (1,2]^^'^ except for a set B C (1,2]^^^ of 
Lebesgue measure 

fi{B) < 6. 

The proof of Lemma |7] can be found in Section IVII-AI 

We now choose rates satisfying these decoding conditions. For ease of notation, we will ignore the 
log(32/5) terms throughout — the reduction in sum rate due to this additional requirement is at most 
21og(32/(5). The optimal allocation of bits at the transmitters depends on the value ni2 + n2i. We treat 
the cases 

I: ni2 -1-7121 e [0,nii] 

II: ni2 + n2i G (^11,^22] 
ni: ni2 + n2i e (^22, riu + 1^22] 
IV: ni2 + n2i E {uu + \n22, §^^22] 

V: ni2 + ^21 G (1^22, nil + ^^22] 
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separately. Since ni2 + n2i < riu + by the assumption max{r2i2, ri2i} < niin{nii, 7222}, this covers 
all possible values of N. 



Case I (ni2 + ^21 < nu): We set 



^11 = nu 



n2i, 



R 
R 



p A 

22 — "^-22 — "^12, 

C A pC A n ^ 

22 ~ -'I'll 



12 



i?21 = 0. 



In words, we solely communicate using the private channel inputs u^^ and 1*22 • Recall that, by our 
assumptions throughout this section, max{ni2, 7^21} < ^11 < ^22- Hence, > and ^22 ^ 0' ^^'^ 
this rate allocation is valid. The calculation in Appendix |A] verifies that this rate allocation satisfies the 
decoding conditions (fT9l ) and (|20l) in Lemma |7l Hence both receivers can recover the desired messages. 
The sum rate is 



(nil - ?^2i) + (^22 - n 



12) = {ni2 + ^21 - nii)+ + (r2i2 + ^21 - ^22)^ + (?^ll - ^21) + (^22 " ^12) 

= Di{N) + {nil - n2i) + (^22 - ^^12) 



(21) 



where we have used that 



"-12 + «21 < nil < 'T-22- 

Case II (nil < ni2 + n2i < ^-22): We set 



P A 
11 



7^11 - ?^2l, 



dP a 

-r<22 — '"'22 — "'12, 

r9 dp 



•-22 



ni2 - R 



ll5 



pC A I? A o An 
J^ll — -K12 — -£L2i — U, 



as shown in Fig. [TOl Since ni2 + ^21 > nu, we have i?22 > 0, and hence this rate allocation is valid. 



Receiver One 



Receiver Two 



□ UdD 



Ull U12 U21 U22 



Uii Ui2 U21 



U22 



Fig. 10. Allocation of bits in case II. Here nu = 10, n22 = 13, ni2 = 8, n2i — 4. The transmitters send private messages at rates Rn — 6 
and _R22 = 5. Transmitter two sends a common message to receiver two at rate _R22 = 2. 

The calculation in Appendix |A] verifies that this rate allocation satisfies the decoding conditions (fT9l) 
and (|20l ) in Lemma |7] Hence both receivers can decode successfully. The sum rate is 

(ni2 + "21-7^11) + (rill - "21) + ("22 - "12) 

= {ni2 + n2i - nii)+ + {ni2 + ^21 - ^^22)"^ + {nu - "21) + (^22 - ni2) 
= Di{N) + {nil - "21) + (7^22 - "12) 

> D{N), (22) 
where we have used that 

''^12 + '"'21 < '"22- 
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Case III (n22 < Uu + ^21 < rin + 1^22): We set 



-^11 


A_ 


nil - ^21, 






22 


A_ 


^22 — "-12, 






R12 


A_ 


{ni2 + 2^21 


- nil 


- n22 


R2I 


A_ 


(^21 + 2ni2 


- nil 


- n22 




_A 


n2i — R22 — 


R21, 




-"-22 




ni2 - Ru - 


R12, 





as depicted in Fig. \TT\ Using 1212 + n2i > n22 and n22 > "11 > max{ni2, ^21}, it can be verified that 
Rn > and R9^ > 0, and hence this rate allocation is valid. 



Receiver One 




W12 



U21 



U22 



Receiver Two 



U12 



U21 



U22 



Fig. 11. Allocation of bits in case III. Here nn — 11, 7122 ~ 13, ni2 = 8, 7121 — 9. The transmitters send private messages at rates Rii = 2 
and R22 = 5. Transmitter one sends a common message to receiver one at rate Rii = 3. Transmitter two sends a common message to 
receiver two at rate R22 = 4. The rates over the cross links are R12 = 2 and R21 = 1. Observe that the interference terms are partially 
aligned at each receiver. 

The calculation in Appendix |A] verifies the decoding conditions (fT9l) and (l20l) in Lemma Ul The sum 
rate of this allocation is 

^12 + ?^21 = ("12 + "21 - "22) + ("12 + "21 - ^11) + ("11 - ^21) + ("22 " ^12) 

= {ni2 + n2i - nii)+ + (^12 + ^21 - ^22)^ + (^11 - "21) + (^22 - "12) 
= Di{N) + {nil - n2i) + (^22 - nu) 

> D{N), (23) 
where we have used that 

"12 + "-21 > ^22 > "11- 

Case IV {nil + ^"22 < "12 + ^^21 < §^22): We set 

-Rii = "11 - "21, 

^22 — "-22 — ■'^12, 

-R21 = [ni2 - \n22\ , 
R^2 = Rii = ["21 — 1^22], 



as shown in Fig. [I2l Note that 



and 



1-12 
dC a 

-n.22 — "22 — "-21, 

nil + \n22 < "12 + "-21 < "-12 + "11 

"11 + 1^22 < "12 + ^21 < nil + "21 
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Therefore, 

in22 < min{ni2,ri2i}- 
It follows that ^21, Ri2, and R^i axe nonnegative, and that this rate allocation is valid. 



(24) 



Receiver One 



Receiver Two 



QSSi n 



till 



Ui2 



U21 



U22 



Wl2 



U21 



U22 



Fig. 12. Allocation of bits in case IV. Here nn — 18, n22 ~ 26, n\2 = W21 = 16. The transmitters send private messages at rates — 2 
and B^2 = 10. Transmitter one sends a common message to receiver one at rate = 3, and transmitter two sends a common message to 
receiver two at rate .R22 = 10. The rates over the cross links are R12 = 3 and R21 = 3. In case IV, the interference terms are completely 
aligned at receiver two, but only partially aligned at receiver one. 

The calculation in Appendix |A] verifies the decoding conditions (fT9l) and (|20|) in Lemma |7l The sum 
rate of this allocation is at least 

(^12 + n2i-\n22) + {rill - ^21) + (?^22 - ^12) - 3 

= I (^12 + n2i + (ni2 + ^21 - Ti22)^) + (^11 - ^21) + (^^22 " ^12) - 3 
= D2{N) + (nil - n2i) + (^22 - ^12) - 3 

> D{N) - 3, (25) 

where we have used (|24)) . and where the loss of three bits is due to the floor operation in the definition 

of R21, R12, R-ii- 

Case V (1^22 < ni2 + n2i < riu + 71-22): We set 

Rii = ^11 - 1^21, 

^22 — ^22 — '"'12, 

R12 — Rii — |_|^2i — I'^iaJ ) 

jS A 73C A I 2 1 I 

-K2I — -K22 — [3^12 — ■3^121] , 

as shown in Fig. \T3\ Using |n22 < + ^21, we have 

2^12 - n2i > 3(n22 - n2i) > 0, (26) 
implying that R21 and R22 are nonnegative. Similarly, 

2n2i - ni2 > 3(^11 - 7212) > 0, (27) 

implying that ^12 and R^^ are nonnegative. Hence this rate allocation is valid. 

The calculation in Appendix |A] verifies the decoding conditions (fT9l) and (|20l ) in Lemma |7] The sum 
rate of this bit allocation is at least 



3(7212 + 7l2i) + (72ii-7l2i) + (7222 - ^^12) " 4 

= D4{N) + (Till - n2l) + (7122 - ^12) - 4 

> D{N)-A, 

where the loss of four bits is due to floor operation in the definition of ^12, -Rn, -R21, -^22- 



(28) 
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Receiver One Receiver Two 




Ull Ui2 U21 U22 Uii U12 U21 U22 



Fig. 13. Allocation of bits in case V. Here nn — 12, n22 = 13, ni2 — 12, 7121 = 9. The private messages to receiver one and two have 
rates Rii = 3 and R22 = 1. The remaining messages to receiver one have rate R12 — Rii = 2, and are both entirely aligned at receiver 
two. The remaining messages to receiver two have rate -R21 = .R22 = 5, and are both entirely aligned at receiver one. 



Combining dlD, (EH), (EH), (ESI), (EH), and accounting for the loss of 21og(32/5) in Lemma |7] shows 
that, assuming 77-22 > ^11, 

C{N) > D{N) - 4 - 2 log(32/(5) 
= D{N)-2\og{ci/6) 

with 

ci = 128. 

If rill > ^22, we can simply relabel the two receivers, and the same argument holds. This relabeling of 
receivers introduces the function D^^N) instead of D2{N) in the lower bound. Together, this concludes 
the proof of the lower bound in Theorem [51 ■ 

B. Upper Bound for the Deterministic X-Channel 

The section contains the proof of the upper bound in Theorem [51 We start with a lemma upper bounding 
various linear combinations of achievable rates for the deterministic X-channel. 

Lemma 8. Any achievable rate tuple {Ru, R12, -^21,-^22) far the (modulated) deterministic X-channel (flTl) 
satisfies the fallowing inequalities 

Ru + R12 + -R22 < max{nii, ni2} + (n22 - ?^i2)^, (29a) 

-Rii + R21 + R22 < max{?T.2i, ^22} + (nil - ^21)"^, (29b) 

-Rii + R21 + R12 < max{nii, ni2} + (n2i - r^n)"^, (29c) 

-R21 + R12 + R22 < max{n2i, n22} + {nu - ^122)"'', (29d) 

-Rii + -R21 + R12 + R22 < max{?T,i2, (rill - ^^21)^} + max{n2i, (^22 - ^12)^}, (29e) 

Rii + R21 + R12 + R22 < max{r;.ii, (^12 - n22y} + max{n22, ("-21 - ^ii)^}, (29f) 

2_Rii + _Ri2 + R21 + R22 < max{nii, ni2} + max{n2i, (n22 - ""-12)^} + ('^ii - ^^21)^, (29g) 

^11 + 2^12 + R21 + R22 < max{nii, 7212} + max{n22, (^21 - -n-ii)^} + {uu - ^22)^, (29h) 

-Rii + R12 + 2-R21 + -R22 < max{r;,22, ^21} + max{nii, (^12 - ^22)^} + (^21 - ^ii)^, (29i) 

^11 + R12 + R21 + 2^22 < max{ra22, ^21} + max{r2i2, (^n - r22i)^} + (^^22 - ^12)^, (29j) 
The proof of Lemma [8l is reported in Appendix [Bl Inequalities (I29al) - (l29fl) are based on an argument 



from [10, Lemma 4.2]. Inequalities (29g)-(29j) are novel. 

The upper bounds in Lemma [8l can be understood intuitively as multiple-access bounds in a channel 
where the receivers are forced to decode certain parts of the interference (see Figs. [51 and [6l in Section [TV-BI) . 
For example, inequality (|29al) corresponds to the multiple-access bound 

Rii + R12 + R22 ^ max{nii, ^12} (30) 
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at receiver one, combined with the inequality 

R22 < (1^22 - ni2y. 

Similarly, inequality (|29el ) corresponds to the multiple-access bound 

^11 + R12 + R22 < max{rai2, {uu - ^21)^} 
at receiver one, combined with the multiple-access bound 

R22 + R21 + Rii < max{n2i, (^22 - ^^12)^} (31) 



at receiver two. Finally, inequality ( [29g[ ) corresponds to the multiple-access bounds (|30|) and 

^11 < ('^ii - ^21)^ 

at receiver one, combined with the multiple-access bound (|3T1) at receiver two. The proof of Lemma [8] 
makes this intuitive reasoning precise. A detailed discussion of this type of cut-set interpretation can be 
found in L19J. 

We proceed with the proof of the upper bound in Theorem [5] for the deterministic X-channel. Under 
the assumption 

min{nii,n22} > max{?7,i2, n2i}, (32) 



the first four inequalities (|29a| )- (|29d|) in Lemma [8] yield the following upper bound on sum-capacity 

CiN) < |(ni2 + n2i) + (nil - ^21) + (^^22 - ^12) 

= D4{N) + {nn - riai) + (^22 - ^12). (33) 

Again using (|32l) . inequality (I29el) in Lemma [8] shows that 

C{N) < max{?2i2, Uu - ^21} + max{r22i, ^22 - n^} 

= (ni2 + ^21 - riiiy + {ni2 + ^21 - ^22)^ + (^ii - ri2i) + (^22 - n^) 

= Di{N) + (rill - r22i) + (^22 - ^12)- (34) 

Inequalities (I29dl) and ( 29g ) in Lemma [8] combined with (|32|) yield 

CiN) < \ (nil + ^22 + max{n2i, ^22 - ^12} + (rin - ^21)) 

= \ {ni2 + ^21 + (ni2 + ^21 - ^22)^) + {nil - ^21) + (^22 - '^12) 

= D2{N) + {nii-n2i) + {n22-ni2). (35) 

Similarly, from (|29c|) and ( 29jp in Lemma [8l 

C{N) < i(ni2 + n2i + (ni2 + n2i - nii)+) + (nn - n2i) + (n22 - ni2) 

= D3(A^) + (nii-n2i) + (n22-ni2). (36) 

The sum capacity is hence at most the minimum of the upper bounds (I33l)-(l36l). i.e., 

C{N) < mm{Di{N),D2{N),Ds{N),D4{N)} + (nn - n2i) + (n22 - ni2) 
= D{N), 

concluding the proof. ■ 

VL Proof of Theorem [6] (Gaussian X-Channel) 

This section contains the proof of the capacity approximation for the Gaussian X-channel in Theorem [6l 
Achievability of the lower bound in the theorem is proved in Section IVLAl the upper bound is proved in 
Section IVLBl 
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A. Achiev ability for the Gaussian X-Channel 

Here, we prove the lower bound in Theorem [6l For ease of exposition, we assume in most of the 
analysis that all channels gains hmk are exactly known at the two transmitters and receivers. The changes 
in the arguments necessary for the mismatched case, in which the transmitters and receivers have access 
only to a quantized version hmk of the channel gain /i^fc, are reported in Appendix O 

Recall that each transmitter k has access to two messages, wik and W2k- The transmitter forms the 
modulated symbol Umk from the message Wmk- From these modulated signals, the channel inputs 

Xi = h22Un + hi2U2l, 
X2 = h2lUi2 + huU22 

are constructed. 

We now describe in detail the modulation process from wik to uik- Each Umk is of the form 

Umk ^^['^mfc]i2 

i=3 

with [urnk]i e {0, 1}. Since h'f^^. < 4 and < 1/16, the resulting channel input Xk satisfies the unit 
average power constraint at the transmitters. Observe that 2"'^^Umk > 1; intuitively, this means that over 
the stronger direct link we only transmit information above the "noise level". 

In analogy to the achievable scheme for the deterministic channel, we only use certain portions of 
the bits [umk\i of the binary expansion of Umk\ the remaining bits are set to zero. The allocation of 
information bits depends on the channel strength N and is chosen as in the deterministic case described 
in Sections IIV-BI and IV-A[ and as illustrated in Figs. [8] and |9l In particular, the messages Umk are again 
decomposed into common and private portions, i.e., 

p c 

Wmfc = ^mk + '^mk- 

The choice of bit allocation from the deterministic model ensures that, at receiver one. 



2""Mn e {0,±1,.. 


.,±(2"" 


-1)}, 


(37a) 


2"-!.^ G{0,±1,.. 


.,±(2"^2 


-1)}, 


(37b) 


2"2^Mfi G {0,±1,.. 


.,±(2"" 


-1)}, 


(37c) 




.,±(2"i2 


-1)}, 


(37d) 



see Figs. [5] and |6] in Section IIII-CI The situation is analogous at receiver two. We denote by Rmk the 
modulation rate of Umk in bits per symbol in analogy to the deterministic case. 

From the bit allocation for the deterministic X-channel, we see that the private portions of the messages 
Ui2 and U21 are always zero, i.e., = ^21 = 0- To satisfy the power constraint (as discussed in the last 
paragraph), we impose that the two most significant bits of each common message are zero. For reasons 
that will become clear in the next paragraph, we also impose that the two most significant bit for each 
private message is zero. This reduces the modulation rate by at most 12 bits per channel use compared 
to the deterministic case. 

The channel output at receiver one is 

1/1 = 2""/inXi + 2"i2/ii2X2 + ^i 

= {hiih22T''u^i + /^i2/i2i2"^^M?2) + /ii2/^ii(2""M^i + ^''^^u^) + (/iia/^n 2"i2m^2 + -^i), 

and similarly at receiver two. The channel output is grouped into three parts. The first part contains the 
two desired signals un and wfg- The second part contains the interference signals and U22- Note that 
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these interference terms are received with the same coefficient hi2hii and are hence aligned. The third 
part contains noise zi and the private portion of the message U22- By construction, 

2"-W2'2 e [0, 1/4) 

so that 

|/ii2/iii2"i2mP2I <4- i < 1. (38) 

We will treat this part of the interference as noise. 
The demodulator at receiver one searches for 

Su — A till, 
S12 — A U12, 

that minimizes 

\yi - hiih22hl + ^12^2lSl2 + hi2hiiSio\- 

Observe that by (1371 ) the search over Smk can be restricted to be over integers with magnitude less than 
2"". The demodulator then declares the minimizing 2~"iisii and 2^^'^'^si2 as the demodulated symbols 
Mil and ni2, respectively, and discards siq. We point out that the demodulator decodes only the sum sio 
of the two interfering symbols, but not the individual interfering symbols themselves. The demodulator 
at receiver two works in analogy. 

We now analyze the probability of error of this demodulator. Let d be the minimum distance between 
any two possible noiseless received signals resulting from different values of (sio, sn, S12). The probability 
of error of the demodulator at receiver one is then upper bounded by 

V{uik ^ uik for k G {1,2}) < 2P(zi + \hi2hii2'''''ul2\ > d/2) 

< 2P(zi > d/2- 1), (39) 

where we have used (|38] ). 

We now lower bound the minimum distance d between the received signal generated by the correct 
(sii, S12, Siq) and by any other triple (s'n, s'12, s'lg). We have 



d= min \hiih22{sii - s[^) + hi2h2i{si2 - 8^2) + hi2hii{siQ - s' 



(sil,Sl2,Slo) 
7^(*ll'*12'*lo) 



10) 



= min biilsii - s'li) +5(12(512 - s'ls) +fi'io(sio - s'lo) I' 

(•Sll,.5l2,Slo) 
7^(Slli'«12)*lo) 

where gmk denotes the products of two hmk as defined in (fT3]) . The next lemma provides a sufficient 
condition for this minimum distance to be large at both receivers. 

Lemma 9. Let 5 E (0,1] and N E Z^^^ such that min{nii,n22} > max{ni2, ^21}. Assume R^i,Rii, 
R12, R21, R22' -^22 ^ ^+ S'^tisfy 

i?fi + max{i?2i, i?22} + R12 + Rii <nu-6- log(4368/5), 

h^..{R,,MO,}>o} ■ ( max{i?2i, R'^2} + R12 + Rii) < ni2 - 6 - log(4368/5), 

l{i?i2>o} ■ (^12 + ^n) < ^^12 + ^21 - ^^22 - 6, 



and 



Rg + max{Ri2, Rn} + R21 + R22 < ^22 - 6 - log(4368/5), 
l{max{ij2i,Ri2,i?f J>o} ■ ( max{i?i2, i?fi} + R21 + Rg) < n2i - 6 - log(4368/(5), 

l{i?2i>o} ■ (^21 + ^22) < ^^12 + ^21 - nil - 6. 
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Then the bit allocation in Section \IV-B\ applied to the Gaussian X-channel (fT6l ) results in a minimum 
constellation distance d > 32 at each receiver for all channel gains (/imfc) ^ (1,2]^^^ except for a set 
B C (1,2]^^^ of Lebesgue measure 

fi{B) < 6. 

The proof of Lemma |9] is reported in Section IVII-B[ Observe that, up to the constants, Lemma |9] is 
exactly of the same form as Lemma |7] in Section IV-AI for the deterministic X-Channel. 

Recall that we have chosen the same allocation of information bits in the binary expansion of Umk 
as in the deterministic case analyzed in Section IV-A[ Since the most significant bit of each Umk is 
zero, the binary expansion of Smk is also of the form analyzed there. Moreover, since the conditions 
in Lemma |9] used here are the same as the conditions in Lemma |7] used in the deterministic case, we 
conclude that Lemma |9] can be applied if we further reduce the rates to accommodate the constant 
6 + log(4368/5) in Lemma |9l This can be achieved for example by reducing the modulation rate by a 
further 3 + | log(4368/(5) < 9.25 + |log(l/(5) per symbol. Accounting for the 12 bits loss per channel 
use due to the power constraint, the sum rate of the modulation scheme is then 

J2 Rmk = D{N) - 12 - 4 • 9.25 - 4 ■ i log(l/5) - 4 

m,k 

= D{N) - 2 log(l/(5) - 53, (40) 

with D{N) as defined in Theorem [5] for the deterministic X-channel, and where the additional loss of 4 
bits results from rounding in the bit allocation for the deterministic scheme as discussed in Section IV-AI 
Lemma [9] yields then that d > 32 for all hmk except for a set B of measure at most 6. Together with 
(|39l ). this shows that the probability of demodulation error is upper bounded by 

2F{zi > d/2 - 1) < 2F{zi > 15) 

< exp(-15V2). 

As mentioned before, this upper bound on the probability of demodulation error is based on the 
assumption that both transmitters and receivers have access to hmk- The analysis in Appendix O shows 
that the only difference under mismatched encoding and decoding, in which the transmitters and receivers 
use max{nmA,}-bit quantized channel gains hmk instead of hmk, is a decrease in the minimum constellation 
distance d. In particular, by (|89| ) in Appendix O the probability of demodulation error with mismatched 
encoders and decoders is upper bounded by 

2P(zi > d/2 - 7) < 2P(zi > 9) 

< exp(-9V2). 

More generally, consider the correct decoding region, and label the incorrect decoding regions of one of 
the two desired symbols, say su, in increasing order of distance to the correct one starting from i > 1 
(see Fig. [14]) . By (l90l ) in Appendix O the probability of decoding to the ith such incorrect decoding 
region with mismatched encoders and decoders is upper bounded by 

F{zi > i{d - 8)/4 - 3) < P(zi >6i-3) 

< iexp(-(6^-3)V2). (41) 

(1,0,0) (1,0,1) (0,0,0) ' (1,1,0) ^ 

Fig. 14. Decision regions at receiver one. Here, we assume the correct decision is (sn, S12, sio) — (0, 0, 0). The figure depicts the region 
resulting in the incorrect decision sn — 1. This region is a union of intervals, with each interval corresponding to a decision (l,Si2,s'io) 
for some s'i2,Sio. See also Fig. |4|in Section Hl-CI 
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To achieve a vanishing probability of error for a fixed finite power P, we use an outer code over the 
modulated channel. Let Rmk denote the rate of this outer code from transmitter k to receiver m. The rate 
Rik achievable from transmitter k to receiver one can be lower bounded by following the first few steps 
in the proof of Fano's inequality: 

Rik = I{u]u) 

= H{u) - H{u I u) 

> Rik — H{u ^ u) — V{u 7^ u)H{u \u^u ^ ti) 

>Rik-l- P(w ^ u)H{u \u,u^u), (42) 
where we have used u and u for uik and uik, respectively. 

Remark: The proof of Fano's inequality continues by upper bounding 

H{u \ u,u ^ u) < Rik- 

This is the approach taken, for example, in [|3l, El- Since Rik itself depends on N, this last upper 
bound is not strong enough to obtain a constant-gap approximation of capacity. Instead, we show next 
that H{u \ u,u ^ u) can be upper bounded by a constant independent of N. This argument is one of the 
key steps in the derivation of the upper bound. 

We have 

P('U 7^ u)H{u \ u^u ^ u) = P('U 7^ u) P('U = q\u ^ u)H{u \ u = q,u ^ u). (43) 
Define the probability 

= P(m = q\u = q,u^u). 

Applying ^ Theorem 9.7.1], 

H{U \ u = q^U^u) = -^ Pq^q logpq^q 

<? 

i / , , / 

< 



- log ( (27re) ( ^ - ( Yl ^P<^(^u) + T2 

^ £=1 1=1 

<llog ((27re)(5^£V.w., + ^ 

^ £=1 



£=1 " ^ 

12. 

< ^ log(27re) + i log(e) ^ £V(£),,f (44) 



2 ' 2 



for any invertible map a into {1, . . . , 2^1'=}. 
Now, 

P(m = q,u = q,u u) 



P(-u = q,u ^ u) 
F{u = q)F{u = q \ u = q)F{u ^u\u = q,u = q) 
P(u 7^ m)P(-u = q \ u ^ u) 
F{u = q\u = q)lq^q 



2^ifeP(M ^ m)P(m = g I u 7^ n)' 
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Combining this with (l44l) yields 

H{u\u = q,ui^u)<- log(27re) + - log(e) o^.tp^- J mp^- ^ , ^ / x • 

I I l"-^^v[u u)r{u = q \ u f= u) 

Choose the map a such that cr(l) = q. Then this last expression can be simplified to 

Hiu \u = q,u^u)< llog(2.re) + ^ Me) gfg .^'J^^ = ^ ' = (45) 

Consider the decision regions at the decoder that result in the incorrect decision u = uik = q. This 
region is a union of intervals with each interval corresponding to a different triple is[-^^, 3^2, s[q), see 
Fig. [m The probability of decoding = Q can hence be upper bounded by the probability that the 
magnitude of the noise is larger than the distance between the noiseless channel output for the transmitted 
triple (sii, Si2, S12) and the closest of the intervals of the decision region resulting in uik = q. This 
probability is upper bounded by ((4T]) . showing that we can choose a such that, for £ >2, 

F{u = q\ u = a{i)) < 2F{zi > 6{i - 1) - 3) 
< exp(-(6£-9)V2), 

so that 

J2 = q\u = a{i)) <J2^^ exp(-(6£ - 9f/2) < 1. 

e=2 1=2 
Together with (gS]) and (gS]), this shows that 

P('U 7^ u)H{u \u,u ^ u) 

1 



< ^ P(m ^ u)V{u = q\u^u)(^- log(27re) + ^ log(e)p^ 



1 

< ^log(27re2). (46) 
Substituting (|46|) into (|42l) shows that the rate between transmitter k and receiver one is lower bounded 

by 

Rik>Rik-l-\\og{2Tie^) (47) 

for all {hmk) G (1, 2]^^^ except for a set B of measure at most 5. 

The same argument holds for the rates R2k at the second receiver with the same outage set B. This 
shows that, for channel gains not in B, we can achieve a sum rate of at least 

Y,^rnk>Y^ Rrnk - 4 - 2 log(27re2), 
m,k m,k 

and hence 

C{N) >Y,Rmk-^-2 log(27re2). 

m,fc 

Using (|40l) . this shows that, except for a set B of measure 5, 

C{N) > D{N) - 2 log(l/5) - 57 - 2 log(27re2) 
= Z}(Ar)-21og(c2/5), 
since D{N) = D{N) by definition, and with 

C2 ^ 27Te^ ■ 228-5. 

This concludes the proof of the lower bound in Theorem [6l ■ 



P('U 7^ u)F{u = q \ u u] 
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B. Upper Bound for the Gaussian X-Channel 

This section proves the upper bound in Theorem [6l We start with a lemma upper bounding various 
linear combinations of achievable rates for the Gaussian X-channel. 

Lemma 10. Any achievable rate tuple {Rn, Ru, R21, R22) for the Gaussian X-channel (fT6l) satisfies the 
following inequalities 

i?n + Rx2 + i?22 < I log(l + 22""/.2^ + 2^---h\^) + 1 log (1 + j , (48a) 



i?n + i?2i + i?22 < i log(l + 2^---h\^ + + 1 log {\ + j , (48b) 

i?n + i?2i ^Ri2<\ log(l + 22""/^^^ + 2^--^h\^ + 1 log (1 + ^-^^-^ j , (48c) 

i?21 + i?12 + i?22 < i l0g(l + 22"-/i^2 + 2'""/^2l) + I log (l + ^V^^J ' ^^^"^^ 



, 92niiL2 , , 92n22/,2 

/2n + i?2i + /2i2 + i?22 < i log (1 + 2^"-/^?, + Y^^^) + I log (1 + l'-^^h\, + - ' 



i?n + R21 + R12 + R22<\ log (1 + 2'-^^hl, + Y^^^;^) + \ log (1 + 2'"^^/^L + Y 



''21 



(48e) 

^^)+ilog(l + 2-/.L + YT2^^,,^^, 

(48f) 



2i?n + i?i2 + i?2i + R22<\ log(l + 22""/i2^ + 22"-/.22) + 1 log (1 + 22^-/^2^ + ^^^^^^^^ 



22"2i/i2^ 



+ 2i?i2 + i?21 +R22<\ log(l + 22"i^/i22 + 22""/i2^) + 1 log (l + 2''''^^hl^ + 22n,,/,2 



22"i2/;,2^ 



/2n + i?i2 + 2/?2i + R22<\ log(l + 22"-/i2^ + 2''^^^hl^) + 1 log (l + 22""/^?^ + ^^^^^^^ , 



22"" 



/2n + i?i2 + R21 + 2/222 < \ log(l + 22-- /.^^ + + i log (l + 2''^^^hl^ + ^^n^./^i^ 

+ llog(l+ ^ '^'Z\ )■ (48j) 

The proof of Lemma [TOl is reported in Appendix iDl Inequalities (|48al) - (|48fl) are from [fTOl Lemma 5.2, 
Theorem 5.3]. Inequalities (48g)-(48j) are novel. 

We proceed with the proof of the upper bound in Theorem [6] for the Gaussian X-channel. Note that, 

for Timk G ^+ and hmk G (1, 2], 

ilog (1 + 22"" /i2^ + 22"i2/i2^) < i log (1 + 4 ■ 22"" + 4 ■ 22"i2) 

< ilog (9max{l,22"",22"i2}) 
= \ log(9) + max{nii, ^12} 
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and 



< \ \og (5max{l,22"22-2ni2}j 

= ilog(5) + (n22 - riis)^. 

In a similar manner, we can upper bound the right-hand sides of all terms in Lemma [10] by quantities 
depending only on N . 
Under the assumption 

min{?2ii,n22} > max{ni2,n2i}, (49) 
the first four inequalities (|48a|) - (|48d| ) in Lemma [10] yield the following upper bound on the sum-capacity 

C{N) < |(ni2 + n^i) + (nn - n^i) + (^22 - n^2) + f log(45) 

< D^{N) + (nn - n2i) + (^22 - ^12) + 4. (50) 

Again using (149] ). inequality (|48e| ) in Lemma [10] shows that 

C{N) < max{ni2, nu - ^21} + max{?22i, n22 - ""-12} + log(9) 

= (ni2 + 7221 - nii)+ + (ni2 + 7221 - ^22)+ + (^n - n2i) + (^22 - ni2) + log(9) 

< Di{N) + {nn - n2i) + {1122 - ^^12) + 4. (51) 

Inequality (I48dl) and (48g) in Lemma [TO] combined with (]49]) yield 

C(-/V) < i (^22 + uxx + max{ra2i, n22 - ^12} + (^n - 7221)) + \ log(5^ ■ 9^) 

= 1(^^12 + ^^21 + ("-12 + «21 - ^22)"^) + ('^ll - ^21) + (^^22 - ^12) + \ log(5^ ■ 9^) 

< D2{N) + (mi - nsi) + (^22 - nis) + 4. (52) 
Similarly, from (|48cl) and (]48j]) in Lemma [JO] 



C(iV) < I (ni2 + ^21 + (ni2 + ^21 - nn)+) + (nn - ^21) + (^22 - ^12) + ^ log(5' ■ 9^) 

< D-i{N) + (nn - ^21) + (^22 - ^12) + 4. (53) 

The sum capacity is hence at most the minimum of the four upper bounds ([50])-(l53]). i.e., 

C{N) < min {D^{N), D^iN), Ds{N), D,{N)} + (nn - n^i) + (^22 - n^^) + 4 
= D{N)+A, 

concluding the proof. ■ 

VII. Mathematical Foundations for Receiver Analysis 

This section lays the mathematical groundwork for the analysis of the decoders used in Sections IV-AI 
and IVI-AI For the deterministic channel model, decoding is successful if the various message subspaces 
are linearly independent. Conditions for this linear independence to hold are presented in Section IVII-AI 
For the Gaussian case, decoding is successful if the minimum distance between the different messages 
as seen at the receiver is large. As we will see, this problem can be reformulated as a number-theoretic 
problem. Conditions for successful decoding in the Gaussian case are presented in Section IVII-BI 



32 



A. Decoding Conditions for the Deterministic Channel 

We start by analyzing a "generic" receiver (i.e., the bit allocation seen at either receiver one or two). 
To this end, we assume there are two desired vectors Ui and U2 and one interference vector no- The 
interference vector consists of two signal vectors that are aligned, and can therefore be treated as a single 
vector. These three vectors are multiplied by the lower triangular channel matrices Gi,G2, and Gq created 
via the binary expansion of the channel gains gi,g2,go as before. We assume that certain components of 
the vectors Uk are set to zero. Specifically, we consider vectors in the set 

W^{%G{0,ir VA;{0,1,2,3}: 

■Uoi = for i G {1, . . . , ni - no} U {rii - riQ + Rq + 1, . . . , ni} 
Uii = for i G {-Ri + 1, . . . , ni}, 

U2i = for z G {1, . . . , ni - ^2} U {ni - ^2 + -R2 + 1, • • • , ^1}, 
M3i = for z G {1, . . . ,r2i - ^,3}}. 

Here, Ui and are to be interpreted as the common and private messages from transmitter one, 
respectively. This situation is illustrated in Fig. [15] The next lemma states that the subspaces spanned by 



ni 



^1 



n2 



i?3 



R2 



Ro 



U2 



Uo 



Fig. 15. A generic receiver as analyzed in Lemma [TTI White regions correspond to zero bits; shaded regions carry information. Bits are 
labeled from 1 to n, starting from the top. 

the corresponding columns of Gk are linearly independent for most channel gains ((70, (?!, (72)- 
Lemma 11. Let no, rii, n2 G such that rii > uq > n2, and let Rq, Ri, R2, R3 G Z+. Define the event 
B{uo,Ui,U2,Us) = {{90,91,92) e (1,2]^ : GqUq © Gi{ui © U3) © G2U2 = O}, 

and set 

B= [J B{Uo,Ui,U2,U3)- 



{uo,ui,U2,U3)eU\{{0, 0,0,0)} 



If there exists 6 G (0, 1] satisfying 



then 



Ri + Ro + R2 + R3<ni- log(16/(5), 
Ro + R2 + R3<no-\ogil6/S), 

R2 + R3 < ^2, 

l^{B) < 6. 



Observe that B is the set of channel gains 90,91,92 such that the corresponding subspaces spanned 
by the selected columns of Gq,Gi,G2 are linearly dependent. Thus the lemma states that if the rates 
Rk satisfy certain conditions, then the subspaces under consideration are linearly independent with high 
probability. 
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The condition on the rates in Lemma [TT] can be interpreted as follows. Since the matrices Gk are lower 
triangular, the subspaces spanned by the last n columns of Gk are the same for all k E {0, 1, 2}. Thus, a 
necessary condition for the linear independence of the three subspaces is that the total number of possible 
nonzero components of Uki with i > rii—n + l and A; G {0, 1, 2} is at most n. By the structure of the set 
U, this condition can be verified by considering only three values of n, namely n E {no, ni, 77-2}. Thus, a 
necessary condition for the linear independence of the subspaces is 

R1 + R0 + R2 + R3 <ni, 
- (ni - no))+ + Ro + R2 + R3< no, 
(Ri - (ni - n2)y + (i?o - K - ^2))+ + R2 + R3< n2. 
After some algebra these three conditions can be rewritten equivalently as 

Ri + Ro + R2 + R3< ni, 
Ro + R2 + R3< no, 

R2 + R3< n2. 

Lemma [TT] shows that, up to a constant log(16/5) and for most channel gains (90,91,92), these necessary 
conditions are also sufficient for the linear independence of the subspaces. 

Before we provide the proof of Lemma [TH we show how it can be used to prove Lemma |7] in 
Section IV-AI We will apply Lemma \TT\ with S replaced by S/2 to guarantee that the outage event at 
each receiver has measure at most 5/2. 

We start by reformulating the conditions in Lemma [TT] for each receiver. Consider first receiver one in 
Lemma [7] The corresponding message rates in Lemma [TT] are given by 

^0 — max{_R2i, -^22)5 

R2 = R12, 
R3 — Rii- 

The choice of the bit levels in Lemma [TT! depends on the values of ^0 and R2. If Rq, R2 > 0, we 
need to set 

no = ni2, 
ni = nil, 

n2 = ni2 + n2i - n22, 
see Fig. [8] in Section IIV-BI The conditions in Lemma [TTj are then that 

R^i + max{i?2i, R'^2} + ^12 + ^11 < nil ~ log(32/5), (54a) 
max{i?2i, ^22} + R12 + Rii < ni2 - log(32/(5), (54b) 
R12 + ^11 < ni2 + ^21 - ^22- (54c) 

If Ro > 0, R2 = 0, then the second column in Fig. [T5]is empty, and hence the third condition in Lemma [TTI 
need not be verified. Formally, note that in this case the value of n2 is irrelevant to the decoding process. 
We may hence assume without loss of generality that n2 is equal to no (thus still satisfying no > n2). 
Thus only conditions (|54a| ) and (|54b| ) need to be checked. If Ro = R2 = 0, the values of no and n2 
are irrelevant to the decoding process, and we can assume them to be equal to ni (again still satisfying 
ni>no> ^2). Thus only condition (|54al) needs to be checked. 
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Consider now receiver two in Lemma |7] By symmetry, Lemma [TTl results in the decoding conditions 

Rg + max{i?i2, + R21 + R22 < ^22 - log(32/5), (55a) 

^{m..{R,,MO^}>o} ■ ( max{i?i2, + R21 + RI2) < n2i - log(32/<5), (55b) 

l{i?2i>o} ■ (^21 + ^22) < ^12 + "-21 - nil. (55c) 

Denote by Bi C the collection of triples {qiq, gu, gi2) such that decoding fails at receiver one. 
Similarly, define B2 C with respect to receiver two. Finally, let i? C R^ be the union of Bi and B2- 
If the two sets of decoding conditions (|54l) and (l55l) are satisfied, then Lemma [TT] shows that 

/i3(5™) < 5/2 

for m G {1,2}, where here and in the following we use the notation ji^ to emphasize the Lebesgue 
measure is computed in R'^. Then 

Ai6(5) < /U6(5i) +^6(^2) 

= /i3(5i) ■ /i3((l, 2f) + /i3(52) ■ /i3((l, 2]3) 

i.e., the collection of channel gains {gmk) £ R^^^ for which decoding fails is small. This concludes the 
proof of Lemma |7l 

It remains to prove Lemma [TTl 

Proof of Lemma [77]- Throughout this proof, we will use the notation n{u) to denote the smallest 
index i such that Ui = 1, with the convention that n(0) = +00. Observe that, by the assumptions on R^, 

Ri + R3 < ni, 
Ro + R3< no, 
R2 + R3 < ^2, 

which implies that 

max [Ri,ni — uq + Rq, rii — n2 + R2} < ""-i — Rs- (56) 

By the definition of U, this guarantees that, if Uk 7^ 0, then n{uk) < ^1 — -R3 for k E {0, 1, 2}. Moreover, 
since Gk is lower triangular with unit diagonal, we have n{GkUk) = n{uk). Together this implies that 

GqUq®Gi{ui®U^)®G2U2 = ^ (57) 

only if 

n(GoWo © GiUi © G2U2) > rii- R3. 
By the same argument, assuming ^ 0, (|57] ) can hold only if (uq, Ui, U2) 7^ (0, 0, 0). Defining the sets 

W = {{uke {0, 1}"^ VA;{0, 1, 2} : (uq, ^^i, ^2, 0) G U] 

and 

B'{uq,Ui,U2) = {{go,gi,g2) e (1,2]^ : n{GoUo © GiUi © G2U2) > ni - R3}, 
we hence have 

BCB'^ U B'{uo,ui,U2). 

{uo,ui,U2)eU'\{0,0,0) 

We can then upper bound fi{B) using the union bound 

KB)<fi{B')< Yl fi{B'{uo,Ui,U2)). (58) 

{ito,'Ui,'it2)GW'\(0,0,0) 
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We continue by analyzing each term in the summation on the right-hand side separately. 

Since we are integrating with respect to Lebesgue measure over {qq, gi, G (1, 2]^, we can equivalently 
assume that qq, gi, g2 are independent and uniformly distributed over (1, 2]. The bits in the binary expansion 
i[9k]i)'iZ-oo of these numbers are then binary random variables with the following properties. [gk]i = for 
i < — 1, [9k]o = 1' and {[gk]i)}=oo i-i-^- Bernoulli (1/2) (see, e.g., [|22l Exercise 1.4.20]). The matrix 
Gk is then constructed from these binary random variables. Note that this implies that the three matrices 
Go,Gi, G2 are independent and identically distributed. Recall that each Gk is lower triangular Toeplitz 
with unit diagonal. 

Fix a binary vector u and consider the product 

b = Gu 

for some G = Gk, u = Uk, and with addition again over Z2. We now describe the distribution of b. Since 
G is lower triangular with unit diagonal, bi = whenever 1 < z < n{u), and bn[u) = 1. In particular, 
n{b) = n{u). Moreover, the components hi for n{u) < z < rai are i.i.d. Bernoulli (1/2). 
Assume first that 

niuo) < n{ui) < n{u2) < 00. (59) 
The summand in (|58] ) can be written as 
li{B\u^, ui, U2)) = ^i^ouo = bi © 62 © b3)P(Gi^xi = b,)F{G2U2 = 62). 

(60) 

Each term in the summation is nonzero only if 

n{bi © 62 © ba) = n{uo), 
n{bi) = n{ui), 
n{b2) = n{u2). 

Since ni — .Ra + 1 is strictly larger than n{uk) for all k G {0, 1, 2} by (|56l ) and using (|59l ), the equality 
n(6i © 62 © bs) = u^uq) can hold only if u^Uq) = n(ui). If the conditions on the Uk and 6^ are satisfied, 
then 

P(GoWo = bi © 62 © b3)P(GiWi = 6i)P(G2W2 = ^2) = 2-("i-"("o»-("i-"("i))-("i-"(^2))_ 
Substituting this into (|60l) shows that 

fi{B'{uo,ui,U2)) < 2^^-("i 

whenever n(uo) = n(ui), and 

ll{B'{Uo,Ui,U2)) = 

otherwise. 

Assume more generally that (ui, U2, U3) 7^ (0, 0, 0). Then a similar argument shows that 

fi{B'{uo,Ui,U2)) < 2^='-"i+"^i^'="(^'=) (61) 

whenever there are two distinct indices k, k' achieving the minimum min^ n('Ufc), and 

^i{B'{uo,u,,U2)) =0 (62) 

otherwise. In particular, the set B'{uq, Ui,U2) has measure zero whenever at least two of the Uk are equal 
to zero. 

For > n'^, let 

U\n-,n+) = {ue {0, l}"i : Ui = \fi E {I, . . . ,ni - n-} U {ni - + 1, . . . ,ni}} \ {0} 
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ni — n 



ni 



Fig. 16. A vector u in the set W'(n White regions represent bits set to zero. 



as illustrated in Fig. [TH We can then rewrite (l58l) as 

M5')< J2 J2 Yl KB'{uo,UuU2)) 

uoGU'{no,no—Ro) ui£U' {ni,ni—Ri) U2&J' {;n2 ,n2— R2) 
uo£U'{no,no—Ro) U2eU'{n2,n2—R2) 

+ Y Y f^{B'{uo,Ui,0)). 

'itoGW'(no,no— -Ro) ui£U'{ni,ni—Ri) 

By (|62|) . the set B' has measure zero whenever there is only a single minimizing n{uk). Together with 
the assumption ni > uq > n2, this shows that we can restrict the lower boundaries of the sets U' in the 
various sums. For example 

Y Y Y f^{B'{Uo,Ui,U2)) 

uoGU'{no,no—Ra) uiGU'{ni,ni—Ri) U2&A' {112 ,n2— R2) 

= Y Y Y f^{B'{Uo,Ui,U2)), 

uoGU'{no,no—Rf)) uiGU' {no,ni—Ri) U2&J' (n2 ,112— R2) 

where we have changed U'(ni,ni — ^i) to U'{no, ni — Ri), and similarly for the other three summations. 
Together with (|6TI) this yields that 

fJ^{B') < n{uk) 
UQ&A' {nQ,no—RQ) ui£U' {no,ni—Ri) U2£U'{n2,n2—R2) 

_j_ "y ^ "y ^ 2^3-^11+111111^.^(11^.) 

ui£U'{n2,ni—Ri) U2&A' {n2 ,n2— R2) 
_|_ y ^ ^ ^ 2^3-'^l+iiiinfc n(afc) 

iioGZJ'(n2,no— -Ro) it2GW'(n2,n2— -R2) 
_j_ y ^ ^ ^ 2-^3-'^i+iiiiiifc n{uk) 

uo€:U'(no,no—Ro) ui£U'{no,ni—Ri) 

We consider each of the four terms in turn. 
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For the first term, we have 



uoGW{no,no— -Ro) iiieW(no,ni~_Ri) it2GW{n2,"2-J?2) 

ni 

^ 53 2 ^ l{mmfcn(iifc)=i} 

i=ni-no + l ttoeS'(no,n.o--Ro) iiiGW'(n(),ni-_Ri) U2eU' {n2,n2-R2) 

ni 

< E E E E 

j=ni-no+l uo€U'{ni-i+l,no-Ro) ui£U' {ni~i+l,ni-Ri) U2&J' (n2,n2-R2) 

Using that 

the right-hand side can be further upper bounded by 

^ 2"i~"'0+^?o— *+i . 2^i~*+i . . 2^3-'^i+« 

fi=ni— no+l 

2^0+fll+i?2+-R3-»10+2 2~* 
i=n\—no+l 

^ 2'f^0+^Ji+-R2+if3-ni+2 

We can upper bound the remaining three terms in a similar fashion, yielding 

IJ,(^B') < 2^3+*^ ^2^°~'~^^~'~^^~"^ + 2^^"^^^""^ -|- 2^"^"^2— "0 _j_ 2^o+^i~"ij 
< 16 ■ 2^''^™^^''^^°^^^^^^^"^'^''^^^""'^^ 

This shows that if 

Ri + Ro + R2 + R3<ni- log(16/5), 
Ro + R2 + R3<no-log{16/6), 

and (in order to guarantee (1561) ) if 

R2 + R3 < n2, 

then 

fi{B) < fi{B') < 6, 

completing the proof of the lemma. 



B. Decoding Conditions for the Gaussian Channel 

In this section, we analyze a "generic" receiver for the Gaussian case. To this end, we prove a variation 
of a well-known result from Diophantine approximation called Groshev's theorem (see, e.g., [23, Theorem 
1.12]). 

Define 

U^{uke [-1,1] VA;{0,1,2} : 

[uo]i = for i G {1, . . . ,ni 
[ui]i = for i e {-Ri + 1, . 
[u2]i = for i G {1, . . . ,ni 
= for i G {l,...,ni 



- no} U {rii - no + i?o + 1, • • •}, 
••}, 

- 77-2} U {ni - ^2 + -R2 + 1, • • •}, 
-i?3}U{ni + l,...}}. 
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lA is the set of real numbers such that their binary expansion, when viewed as vectors of length rii, is in 
the set U as illustrated in Fig. [T5] in Section IVII-AI Here we assume that the binary expansion of u and 
—u is identical. 

Lemma 12. Let nQ,ni,n2 G such that ni > uq > n2, and let Rq, Ri, R2, R3 G Define the event 

B{uo,Ui,U2,U3) = {(90,91,92) G (1,4]^ : 1 5-0^^0 + 5-1 (^i + W3) +5-2^^21 < 2^""^}, 

and set 

B= y B{uo,Ui,U2,U3). 

{uo,ui,U2,U3)€U\{{0,0,0,0)} 

If there exists 5 G (0, 1] satisfying 

Ri + Ro + R2 + R3<ni-Q- log(2184/5), 
Ro + R2 + R3<no-6- log(2184/5), 
R2 + Ri<n2- 6, 

then 

li{B) < S. 

Lemma [T2] is the equivalent for the Gaussian channel of Lemma [TT] for the deterministic channel. Note 
that, except of the differences in the constants, the conditions on the rates in the two lemmas are identical. 

We now prove Lemma [9] in Section IVLAI using Lemma [T2l Apply Lemma [T2l with 6/2 instead of S and 
the same rate allocations as in the deterministic case, see Figs. [8] and |9] in Section ITV-B[ Let Bm C (1,4]'^ 
be the collection of triples {9mo, 9mi, 9m2) such that decoding is successful at receiver m. Define Bm as 
the collection of channel gains (hmk) C (1,2]^^^ such that the corresponding {9mk) are in B„i- Finally, 
let B denote the union of Bi and B2. Following the same arguments as in the proof of Lemma |7] from 
Lemma [TT] presented in Section IVH-Al it can be shown that if the decoding conditions in Lemma |9] are 
satisfied, then 

fisiBm) < 6/2 

for m G {1, 2}. 

The next lemma allows us to transfer this statement about the products 9mk of channel gains to the 
corresponding statement about the original channel gains hmk- For ease of notation, define 

90 — hiihi2, 

91 — ^11^22, 

92 = hi2h2l. 

Lemma 13. Let B C (1,4]^ be a subset of channel gains {90,91,92) such that fi3{B) < 6. Define 

B ^ {{hmk) e {1,2]'-' : {90,91,92) e B}. 

Then ^^{B) < 6. 

The proof of Lemma [13] is reported in Appendix |E] 

Applying Lemma [13] to the sets Bi and B2 corresponding to the outage events defined above, this 
implies that 

liA{Bm) < 6/2. 

Hence, 

proving Lemma [9] 
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We continue with the proof of Lemma [121 Instead of directly analyzing the set B in the statement of 
Lemma [T2l it will be convenient to work with an equivalent set. Note that B{uo, Ui, U2, Us) can be written 
as 

B{uo, ui, U2, U3) = {|(7o2"^^io + (?i2"H^i + ^s) + 922'''u2\ < 2^}, 

where we assume throughout that all sets are defined over {go,gi,g2) G (1,4]^. By the definition of U, 
we can decompose 

2^^m = A[q,, 

2"^M2 = ^2^2, 

2"^% = gs, 



Qk £ {~<5fe) —Qk + 1, • • • , Qfc — 1, Qk] 



with 

for e {0, 1,2} and 
for k G {0, 1, 2, 3}, where 

Qk = 2^K 

Using this, we can further rewrite 5(^0, mi, M2, M3) using the triangle inequality as 

fi(uO,Ui,M2,?^3) = {l^ofi-ogo + ^'ifl-igi +fi'ig3 + ^2fl'2g2| < 2^} 

^ {\A'm^ + A^g^q^ + A'2f72g2| < 2^ + 2^'+^} 
^ {\Aogoqo + A[giqi + ^'2^292 1 < P'} 

= 5'(go,gi,g2), 



where we have defined 



Setting 



(3' = 2^'- 



B'^ [j B\qo,qi,q2), 



'?0i9li'?2G^: 
(90,91,92)7^0, 
|9*l<'3feVfc 



we then have 



l^siB) < /i3(i?')- 
The next lemma analyzes the set B' with A'q = 1. 

Lemma 14. Let (3 G (0, 1], Ai,A2 G IN, and Qo, Qi, Q2 G IN. Define the event 

B'iqo,quq2) = {igo,gi,g2) e (1,4]^ : {g^qo + Aigiqi + A2g2q2\ < /?}, 

and set 

B'= U i?'(go,gi,g2). 

90,9l,92eZ: 
(90,91,92)7^0, 
l9ftl<QfcVfc 
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Then 



with 



< 168/3(2min{g„|^} +min{Q,g2,^,^} 

+ 2 mm |Qi, - | + mm jg^Qi, ^ | J 

A . max{Qo,^2Q2}\ 
Qi = mm |Qi, 8 1, 



Q2 = mm 1^2, 8 1. 



,max{(5o, ^iQi} 

Remark: The special case of Lemma [14] with Ai = A2 = 1, Qo = Qi = (52 = and Q ^ 00 
corresponds to the (converse part of) Groshev's theorem, see, e.g., [l 23l Theorem 1.12]. Hence, Lemma [141 
extends Groshev's theorem to asymmetric and non-asymptotic settings. 

Before we present the proof of Lemma [T4l we show how to prove Lemma [I2] with the help of Lemma [141 
Proof of Lemma [721- We consider the cases A'q < mm{A[, A'2}, A[ < mm{A'Q, A'2}, and A'2 < 
mm{A'Q, A[}, separately. 

Assume first that A'q < min{y4'^, Aj}. Define 

^ = 1, 

Ai = A'^/A'q = 2^''-fii-"o+"i 

A2 = A'JA'q = 2^0-i?2-no+n2^ 
/3 A l3'/A'o = 2^0+R3-no+6^ 

Note that Ai, G IN, and that /3 e (0, 1] if 

Ro + R3<no- 6, (63) 

as required by the assumption in Lemma [Ml The quantities Qi and Q2 in Lemma [Ml can be upper 
bounded as 

Qi < 8max{go,^2<32}Mi = 8Q0/A1, 
since uq > n2 implies that Qq > A2Q2, and 

Q2 < Q2. 

Applying Lemma [Ml yields then that 

KB) < KB') 

<168^(2Q2 + ^ + 2^^+Q2Qr) 
< 168/3 (2Q2 + Ml + + 8^) 

^2*52 Qo Q0Q2 

Ai 'A^' , 
Q0Q2 
~Ar 



< 2 184/3 max I Q2, 
= 2184/3 max I Q2, 
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where we have used that Q2 > I and that A2Q2 < Qo implying 

A2Q2 ^ Q0Q2 

Substituting the definitions of /3, A^, and Qk, yields that 

fi{B) < 2184 •2^«+^3-"o+6 max {2^%2^i+-f^2+"«-"i}. 

Together with (l63l) . this shows that if 

i?i + i?o + i?2 + i?3 < ^1 - 6 - log(2184/5), 
i?o + ^2 + < ^0 - 6 - log(2 184/5), 
-Ro + -R3 < ^0 - 6, 

then 

fi{B) < 6. 

Since S E (0, 1] and R2 > 0, the third condition is redundant and can be removed, showing the result in 
Lemma [T2l We point out that the third condition in Lemma [T2l is not active if Aq < mm{A[, A2}. This 
is consistent with it not appearing in the derivation here. 
Assume next that A[ < minjAQ, A2}. Define 

Ao ^ A'o/A[ = 2-^o+^i+'^o-'^i, 

Ai = l, 

A2 = A'JA[ = 2^i-^2-ni+n2^ 
(3 4 (3'/A[ = 2^i+^3-ni+6_ 

Note that Aq, A2 G IN, and that f3 e (0, 1] if 

i?i + i?3 < m - 6. (64) 

We can hence apply Lemma [14] by appropriately relabeling indices (i.e., by swapping indices and 1). 
The quantities Qo and Q2 can be upper bounded as 

,max{Qi, ^2^2} 

and 

Q2 = mm jQa, 8 1 < Q2. 

Applying Lemma [M] yields then that 

/.(i?) < /i(i?') 

< 168/5 (2Q2 + Q0Q2 + 2Qo + Q2Q0 

< 168/3 (2Q2 + Q0Q2 + 2Qo + Q2Q0 

< 1008/3goQ2. 
Substituting the definitions of /3, A^, and Qk, yields that 

/i(5) < 1008 • 2^0+^i+-f^2+53+6-ni_ 



Qo = mm |go, 8 -j 1 < Qo, 
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Together with (|64l ). this shows that if 

Ri + Ro + R2 + R3<ni-Q- log(1008/5), 
Ri + R3<ni- 6, 

then 

fi{B) < 6. 

Since 5 E (0,1] and Rq, R2 > 0, the second condition is redundant and can be removed, showing the 
result in Lemma [T2l As can be verified, the second and third conditions in Lemma [T2l are not active when 
A[ < min{ylQ, A2}, consistent with them not appearing in the derivation here. 
Finally, assume that A'^ < mm{AQ, A[}. Define 

Ao ^ A'JA'2 = 2-^0+i?2+no-n2^ 

Ai = A'i/A2 = 2^^'^^^'^^^^~^^ 

A2^l, 

Note that Aq, Ai e IN, and that f3 e (0, 1] if 

i?2 + i?3 < ^2 - 6. (65) 

We can hence apply Lemma [141 by relabeling indices as before (this time by swapping indices and 2). 
The quantities Qo and Qi can be upper bounded as 

,max{(52, AiQi} ■ 



Qo = mm 1^0, 8 1 < Qo 



and 

,max{Q2,^o<5o} 



Qi = min 8 



Ai 

max{Q2,AoQo} 
A, 

A, 



since no > ^2 implies AqQq > Q2. 
Applying Lemma [14] yields then that 



.Q2 , Q2Q0 , c^Q2 , Q2Q 



<168/3f2^ + ^ + 2, , , 
V Ao Ai Ai Aq 

<168/3f2^ + ^ + 2^ + °^^^° 
~ V Ai 

< 2184/3 max 



Ai 



Q2 Q2Q0 



Ao A, 

Substituting the definitions of (3, A^, and Qk, yields that 

< 2184-2^2+R3+6-n2j^^|2Ro+n2-no^2^0+-^i+"2-nij_ 



43 



Together with (|65l ). this shows that if 

Ri + Ro + R2 + R3<n,-Q- log(2 184/5), 
Ro + R2 + R3<no-Q- log(2 184/5), 
R2 + R3<n2- 6, 

then 

KB) < 6, 

showing the result in Lemma [T2l It can be verified that, unlike in the other two cases, all three conditions 
in Lemma [T2l can be active when A'2 < rmn{A'Q,A[}. This is again consistent with the derivation here. 
This proves Lemma [T2l ■ 
It remains to prove Lemma [l4l The proof builds on an argument in [|24ll . 
Proof of Lemma \T4\- Define 



kol<Qo 



B'iqi,q2)= [j B'{qo,quq2) 

for (gi,g2) 7^ (0,0), and 



i?'(0,0)^ U 5'(go,gi,g2). 

<?oez\{o}: 
'?ol<Qo 

For 5(0 G (1,4], set 

B'Jqi,q2) = {{91,92) G (1,4]^ : {90,91,92) G B'{q,,q2)}. 
Observe that B'g^{qi, ^2) is a subset of and that 

f4 



fi3{B'{qi,q2)) = / fX2{Bg^^{qi,q2))dgo. 

■/ on=l 

it the cases Ai\qi\ < 

q2 ^ 0. If 



90=1 

We treat the cases Ai\qi\ < ^21^2 1 and Ai\qi\ > A2\q2\ separately. Assume first Ai\qi\ < A2\q2\ and 



A2\q2\ > 8max{Qo,^Qi} + 1, 

then 

l^fo^o + Aigiqi + A292q2\ > ^2fl'2|'?2| - ^ifl-ilgil - 9o\qo\ 

> A2\q2\ -4:AiQi-4:Qo 

> 1 

where we have used that /3 < 1. Hence, ^2{Bg^{qi, q2)) = 0. We can therefore assume without loss of 
generality that 

^2|'?2| < 8max{Qo, ^iQi}- 
By a similar argument, we can assume that 

^2|g2| < 4go 

whenever qi = 0. The set B'g^{qi,q2) consists of at most 

min{3Qo, 7A2|g2|} 
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Strips of slope —Aiqi/{A2q2) and width 2/3/(^42 |g2 1) in the g2 direction, including several partial strips 
(see Fig. [l7]). The area of this set is at most 

A2\q2\ 



/^2(5go(^i'^2)) < ^ i„ I min{3go,7A2|g2|} 



<14/3min<j-^,l^. (66) 



(1,4) (4,4) 



92 




(1,1) 91 (4,1) 



Fig. 17. Illustration of the set B'i{l, —2) C (1, 4]^ with go = 1, gi = 1, q2 = —2, Ai = A2 = 1, and /3 = 0.2. In the figure, we assume 
that Qo ^ yl2|g2|. The set consists of 10 < 7A2\q2\ ~ 14 strips of slope 1/2 = —^191/(^252). 



We now consider the case > v42|g2| and qi ^ 0. As before, we can assume without loss of 

generality that 

Ai\qi\ < 8max{(5o, ^2(52} 

for all q2, and that 

< 4Qo 



/X2(i?;,(gi,g2)) < 14/3min|-^,l}. (67) 



whenever q2 = 0. By the same analysis as in the last paragraph, we obtain that 

Qo 

■Ai\qi\ 

Finally, when (91,^2) = and go 7^ 0, then go\qo\ > 1 > /3, and hence 

/Z2(5;(0,0))=0. (68) 

We can upper bound 

B'{qo,qi,q2) 
\ l9i|<Qi \g2\<Q2 (qo,'?i ,92)7^0 / 

< E E -"3(5'(gi,g2)) 



'7l|<Ql \Q2\<Q2 

f4 



(leZ: gaeZ: >^90=1 
l|<Ql k2|<Q2 

5^ /" fi2iB'JO,q2))dg, 



g2eZ\{0}:|g2|<Q2 
A2|g2|<4Qo 



^r^^ r^i.i^ 1^^ , ^r^x rol l. 1^^ "'90 = 1 



'?2eZ\{0}:|g2|<Q2 gieZ\{0}:|gi|<Qi 
A2|g2|<8max{Qo,AiQi} Ai\qi\<A2\q2\ 
f4 



+ 5^ f _ f,2{B'JquO))dgo 

giGZ\{0}:|9i|<Qi-^90-l 
Aij5i|<4Qo 

r-4 



qiGZ\{0}:\qi\<Ql <72eZ\{0}:|<?2|<Q2 = ^ 
Ai|5i|<8max{Qo,A2Q2} ^2|g2|<Ai|gi| 

Combined with ([661), (EI]), and ([68]), this yields 



<72eZ\{0}:|g2|<Q2 
^2|'22|<4Qo 



<72eZ\{0}:|<?2|<Q2 gieZ\{0}:|gi|<Qi 
A2|g2|<8max{Qo,^iQi} Ai\qi\<A2\q2\ 

Qo 



giGZ\{0}:|5i|<Qi 
Ai|gi|<4Qo 

+ E E ^^-ii^.i}- 



gieZ\{0};|gi|<Qi g2eZ\{0}:|g2|<Q2 
Ai \qi\<8 max{Qo,^2Q2} A2 |g2 1 <^i ki I 

We now upper bound the four terms in the right-hand side of (|69l ). 
For the first term, observe that 



\{q2eZ\{0}: Igal <Q2,A2|g2| <4Qo}| <2min|Q2,4|^} 

< 8min|Q2, 



so that 

\^ A9.Rm\n I 

A2\q2\ 



g2eZ\{0}:|g2|<Q2 
^2k2|<4Qo 

For the second term in (|69l) . observe that 



^ ^2;3mm{-^.l}< 336^mm{Q..|£}. 

[0}:k2|<r 

^2k2|<4Qo 



\{q, G Z \ {0} : |gi| < Qi,AM < A^lgsl}! < 2min{Qi, 



and hence 



giGZ\{0}:kl|<Q 

-4iki|<-42k2| 



^ o in *5o ^2|g2|\ 
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Moreover, 

{q2 G Z\{0} : Igal <Q2,A2|g2| < 8 max{Qo, ^iQi}} = {q2 G Z \ {0} : Igal < Q2} 

with 

Q2 = mm 1^2, 8 1. 

Using these two facts, we can upper bound 

Qo 



g2eZ\{0}:|g2|<Q2 <?ieZ\{0}:|gi|<Qi 
A2lg2|<8max{Qo,AiQi} Ai|gi|<A2|g2| 



g2eZ\{0}:|g2|<Q2 



<?2eZ\{0}:|g2|<Q2 

< 168/3 min{Q,Q2,^,^}. (71) 



Similarly, for the third term in (1691 ). 



5^ 42/3min{-^,l}< 336/3 min{Qi,^}, (72) 



gieZ\{0}:|gi|<Q 
^iki|<4Qo 



and for the fourth term 



Y: E 42/3min{-^,l}< 168/3 min{Q2Qi,^,^} (73) 



giGZ\{0}:|gi|<Qi g2eZ\{0}:|g2|<Q2 
Ai|gi|<8max{Qo,A2Q2} A2\q2\<Ai\qi\ 



with the definition 

/=) ^ • /ri max{go,^2<52}\ 
Qi = mm 8 1. 



Substituting dVOj-dTS]) into (1691) yields 



f,s{B') < /3(^336min {q^, f^} + 168 min {q^Q„ ^} 

+ 336 min {q„ + 168 min {q^Qi, ^}) : 



completing the proof. 



VIII. Conclusion 

In this paper, we derived a constant-gap capacity approximation for the Gaussian X-channel. This 
derivation was aided by a novel deterministic channel model used to approximate the Gaussian channel. In 
the proposed deterministic channel model, the actions of the channel are described by a lower-triangular 
Toeplitz matrix with coefficients determined by the bits in the binary expansion of the corresponding 
channel gain in the original Gaussian problem. This is in contrast to traditional deterministic models, in 
which the actions of the channel are only dependent on the single most-important bit of the channel gain 
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in the original Gaussian problem. Preserving this dependence on the fine structure of the Gaussian channel 
gains turned out to be crucial to approximate the Gaussian X-channel by a deterministic channel model. 

Throughout this paper, we were only interested in obtaining a constant-gap capacity approximation. 
Less emphasis was placed on the actual value of that constant. For a meaningful capacity approximation 
at smaller values of SNR, this constant needs to be optimized. More sophisticated lattice codes (as opposed 
to the ones over the simple integer lattice used in this paper) could be employed for this purpose [|25l . 
Furthermore, all the results in this paper were derived for all channel gains outside an arbitrarily small 
"outage" set. Analyzing the behavior of capacity for channel gains that are inside this outage set is hence 
of interest. An approach similar to the one in [|26| could perhaps be utilized to this end. 

Finally, the analysis in this paper focused on the Gaussian X-channel as the simplest communication 
network in which interference alignment seems necessary. The hope is that the tools developed in this 
paper can be used to help with the analysis of more general networks requiring interference alignment. 
Ultimately, the goal should be to move from degrees-of-freedom capacity approximations to stronger 
constant-gap capacity approximations. 

Appendix A 
Verification of Decoding Conditions 

This appendix verifies that the rate allocation in Section IV-AI for the deterministic X-channel satisfies 
the decoding conditions (fT9l) and (|20|) in Lemma |7l 
Case I {ni2 + ^21 < '^ii): Recall 

-Kii = nil - n2i, 

dP a 

K22 = ^22 — "-12, 

pCApCAp Ap An 
ri22 — -^11 — -K12 — -K21 — U. 

This choice of rates satisfies (|19al ) and (|20a|) . Since these are the only two relevant conditions in this case, 
this shows that both receivers can recover the desired messages. 
Case II (nil < "'^12 + ^21 < "'^22): Recall 

dP a 

Rii = nil - n2i, 

pP A 

K22 — '"■22 — ni2, 

R22 — ^12 — Rll, 

pC A p A p An 
-n-ii — K12 — -fX2l — U. 

At receiver one, (|19al ) and (|19b|) are satisfied since 

R22 + Ru = ni2 < nil. 

Condition (|19cl) needs not to be checked here. At receiver two, (|20al ) is satisfied since 

R22 + -R22 = ''^22 + n2i - nil < n22- 

Conditions (I20bl) and (I20cl) need not to be checked here. Hence both receivers can decode successfully. 
Case III (^22 < ni2 + "21 < nn + |n22): Recall 

dP a 

-Rii = nil - n2i, 

dP a 

^22 = ^^22 - ni2 



22 ~ "-22 "-12, 

R12 = ("12 + 2n2i - nil -n22) - , 
R21 = ("21 + 2ni2 - nil - "22 

Rii — "21 — -R22 ~ -^21, 
R22 — "12 — Rii — Ri2- 
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To check the decoding conditions (fT9l) and (|20l) . we first argue that 

max{^2i , ^22 } = ^22 > C74a) 
max{i?i2,i?fi} = (74b) 

The first equality trivially holds if R21 = 0. Assuming then that ^21 > 0, we have 

-^22 ~ -^21 = ""-22 — — Rl2- 

If R12 = 0, then this is nonnegative. Assuming then R12 > 0, we obtain 

^22 - R21 = 2^22 + rill - 2(ni2 + ^21) > 0, 

where we have used that n22 > fiu and that riu + \n22 > nu + n2i. This proves (I74al) . Using a similar 
argument, it can be shown that Rii — R12 > 0, proving (|74bl) . To check the decoding conditions (fT9l ) at 
receiver one, observe now that 

^22 + -^12 + -Rll = rii2, 

satisfying (I19bl) . Moreover, 

R^i + -R22 + -^12 + -Rii = -Rn + ^12 

= 2^12 + n2i - n22 - R21 
< nil 

satisfying (|19al ). Finally, if R12 > 0, then 

R12 + Rii = ni2 + n2i - n22, 

satisfying (|19cl) . and if _Ri2 = 0, (I19cl) is irrelevant. Using a similar argument, it can be shown that the 
decoding conditions (|20l ) at receiver two hold. 
Case IV (rill + \n22 < ^12 + ^21 < |?t-22): Recall 

-Kli = nil - ^21, 
dP a 

-f<22 — ""-22 — ■'^12, 

-R21 = [ni2 - \n22\ , 

R12 = Rii = [n2i — 1^22] , 

-R22 = ^^22 - ^21- 

To check the decoding conditions, note first that 

max{^2i, -R22} = -^22) 
max{i?i2,i??i} = i??i, 

since 

1^22 > ni2 + n2i 
by assumption. For receiver one, we then have 

i?ll + -R22 + -^12 + Rn ^ "^11; 

satisfying (|19al) . Moreover, 

^22 + -R12 + -Rii < nil + |'^22 - n2i < ni2, 
where we have used riu + \n22 < ni2 + ^21. Hence (|19b|) is satisfied. Finally, 

R12 + -Rii < nil - hn22 < ni2 + ^21 - ^22, 
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where we have again used riu + \n22 < ^12 + ^21- Hence (|19cl ) is satisfied. Together, this shows that 
decoding is successful at receiver one. At receiver two, we have 



satisfying (|20al) . and 
satisfying (I20bl) . Finally, 



i?22 + -^n + -^21 + -R22 — ''^22; 



i?^]^ + -R21 + -R22 — ^21, 



-R2I + -R22 < |'^22 < ^112 + ^21 - nil, 



where we have used nu + 1^22 < ni2 + n2i. Hence (I20cl) is satisfied. Together, this shows that decoding 
is successful at receiver two. 

Case V (|n22 < ni2 + ^21 < rin + 77.22): Recall 

-Rii = nil - n2i, 

-r<22 — '"■22 — ni2, 

Rl2 = -Rn — |_i^21 — |'^12j ! 

-R2I — -R22 — |_i^l2 — |'^2lJ • 

For decoding at receiver one, we need to verify the decoding conditions (fT9l ). We have 

-^n + -^22 + -^12 + Rii < nil, 

satisfying (|19al ). Moreover, 

^22 + -R12 + -Rn < '^ll - 1(2^21 - ^12) < 7712, 

where we have used (l27l) . This satisfies (|19b| ). Finally 

-R12 + -Rn < nil - |(ni2 + ^21) 

= 77ii + 77i2 + 7721 " |(^12 + ^2l) 

< 77ii + 77i2 + 7721 - 27722 

< 7712 + 7721 - 7722, 

where we have used that 7712 + 7721 > §7722. Hence (I19cl) is satisfied. A similar argument using (l26l) instead 
of (ITTI) . shows that the decoding conditions (t20l) at receiver two hold. Hence decoding is successful at 
both receivers. 

Appendix B 
Proof of Lemma [8] in Section EE 

Throughout this proof, we make use of the fact that, for the (modulated) deterministic X-channel (fTTI) . 
the definition of capacity imposes that 

uZl = iu^k[t])J=i 

is only a function of Wmk- 

We start with (|29al ). Define S12 as the contribution of the second transmitter at the first receiver, i.e., 

.-12 ^ G12 ( _°c ) © Gio ( _| 
Let S22 denote the contribution of the second transmitter at the second receiver, i.e., 

S22 = G22'U22 © G20'"l2- 
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Similarly, we define sn and S21 as the contributions of the first transmitter at the first and second receivers, 
respectively. With this, we can rewrite the received vector at receiver m as 

For block length T, we have, 

T{R22-s) < l{w22;yP) 

< I{w22; yf\ ^in\ uf^\ W12) 

= I{w22\y?\s''y2^ I uf^\'il2i\wi2) 
= /(W22; S22 >Si2 I W12) 

= I{W22\ 4? I ^12) + I{W22\ 4? I SiP, W12) 

< H{s^S I w^2)+H{sg^ I si?,«;i2), (75) 
where the first step follows from Fano's inequality. In addition, using again Fano's inequality, 

T{R^^ + R^2 - e) < l{w^i,w^2;yD 

< /(wii,wi2,W2i;^r^) 

= H{yP) - H{yP \ 1^11,^12,^21) 

= H{yP)-H{sg^ \wu). (76) 

Adding (|75]) and (|76l) yields 

T(i?ii + i?i2 + R22 - 2e) < H{yP) + | sg\ 10,2). 

For the first term on the right-hand side, we have 

H{y''p) < Tmax{nii,ni2}. 
For the second term, recall that u\2 is a function of only W12, and hence 



<H[G22U',, 



G I 



Since Gmk is lower triangular with nonzero diagonal, it is invertible, implying that 

H{sg^\sg\w,2)<H{ug^\iugY^^) 

< {n22 - ni2y. 

Together, this shows that 

T(i?ii + i?i2 + i?22 - 2£) < Tmax{riii,ni2} + /f((w^2)^^^ | (^22)^^^) (77) 

< T(max{?2ii, ^12} + (^22 - 7^12)^)- 

Therefore, as T — )■ 00 and e — )• 0, we have (|29al) . Similarly, we can prove (I29bl) . (|29cl) . and (I29dl) . 
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We now establish the upper bound (|29el) . Starting with Fano's inequality, 

T(i?n + Ri2 -e)< l{wu, wu, vP) 



< /(wii, W12; yP, 4?, ^^2i) 



I[wu,wi2;yl I W21) 



/(wii, W12; I W21) + I{wu,wi2;yi^ \ s''2i,W2i) 



I AT) 



< H[sf^^ I W21) +H{y[^' I s^{\w2i) - H{y\^^ \ s^{\ W21, u^n, 



.(T) I ^(T) 



(T) I ^(T) 



'21 

'21 



«;2i)+i^(yr 
«;2i)+i^(yr 



521^^21) -H{y[^' I s^{\w2i,wii,wi2,wi{\ 



S2?>^2l) - ^(sl2^ I ^^12) 



Similarly, we have 

T(_R2i + -R22 
Adding (|78]) and (|79l), we obtain 

T{Ru + R12 + R21 + R22 - 2e) < H{yP 



.(T) I AT) 



(T) -(T)^ 



(T) 
12 



e) < H{sf^ I w,2) + H{yf^ I ^12) - (4T^ I ^21) 



(r) I AT) 



-ST) 
'21 



(78) 
(79) 



521^^21) +H{y^^^ I si2\wi2) 



JT) I -(T) 



14?, 4?)+ ^(^rM-- 



r.C^) I JT) (T) 
12 5 "'12 



where in the last line we have used that u'^l is only a function of Wmk- For the first term, we obtain 
using invertibility of the matrices Gmk, 



H{y[ 



,-.(T) I AT) „AT) 



'21 ' '"21 

H{yf' I 



iiJ 



21 



<H[G^ 



o(^) 



12 



9A(t) 



u 



12) 



G 



10 



Q(T) 



22; 



C ^{T) 
III 



G 



12 



10 



'U^22)^^' 



Since the matrices Gmfc are lower triangular, this last term is upper bounded by 

Tmax{ni2, (nn - n2i)^}. 

By an analogous argument, 

H{yf^ I ^Ss^ttSP) < Tmax{n2i, (n22 - ^12)^}- 

Together, this shows that 

T{Rii + R12 + -R21 + -R22 - 2e) < r(max{ni2, (rin - ^21)"''} + max{n2i, (n22 - ^12)^}), 

proving (|29e| ) as T — > 00 and e — > 0. Similarly, we can prove (|29f|) . 
We now establish the bound ( |29gp . By Fano's inequality, 

T(i?21 + i?22 -S)< I{W21, W22; yP) 

r,F) (.-.CAT) 



< l{w21,W22;y2 % (^22)^^S^12) 

= /(w2i,w;22;^r^ ('"22)^^^ I ^12) 

= J(l(721, W22; (*i22)^^^ I ^12) + l{w21,W22;y^P \ Wl2, ('"22) ^^^) 
= H{iugY^^)+l{w2uW22rsP;yP I «^12,(*^^2)^^^) 



-^(^21 ;?/2 I ^12,^21,^22,(^22)^^0 

H{iugY^^)+H{yP I w,2,{ugY^^)-H{{u^,Y^^). 



(80) 
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Moreover, using again Fano's inequality, 

T{Ru-e)<l{wu;y^P) 

= l{wii;yP I 1^12,^21,^22) 

= I ^12,^21,^^22) 

= H{uP) 

= i7((nf,)(^))+i7(K)(^)|(^?,)(^)). (81) 

Adding §^ and dSB yields 

T(i?ii + i?2i + i?22 - 2e) 

< H{yP I w,2,iugr')+H{iu[,r^ I in^,p^^)+H{iugr^) 

< T( max{n2i, (n22 - ^12) + } + (un - n^i^) + H{{ugY^^). (82) 
Combined with ([77]) derived earlier, we obtain 

T{2Ru + R12 + R21 + 2R22 - 4£) 

< T(max{nii,ni2} + max{?T,2i, (?t.22 - ^12)^} + ('^ii - "^21)^) 
+ i^((^^^,)(^))+f^(«,)(^)|(^.^^)m) 

= T(max{raii,rai2} + max{n2i, (^22 - '^12)'*'} + ("-ii - '^21)'*") + if ((1*22)^'^^) • 
Since (^22)*^^^ is a deterministic function of u'22, we have 

^((^22)^^^) <i^(w^22)=Ti?22. (83) 

From (l83l) . we obtain 

T(2^ii + ^12 + -R21 + -R22 - 4£:) < r(max{nii,?2i2} + max{n2i, (^22 - '^12)'^} + ("n-ii - ''^2i)'^)- 
Letting T — )■ 00 and e — )■ yields the upper bound ( |29gp . Similarly, we can prove (|29h|) -( [29]] ). ■ 



Remark: Equation [83] is a key step in the derivation of the outer bound (29g). If we had used the 
standard bound H(^(u22Y^^) < Tmax{r222, ^^21}^ we would have obtained a looser bound than (29g). 

Appendix C 

Analysis of Mismatched Encoders and Decoders 

The proof of Theorem [6] in Section IVI-AI assumes that the precise channel gains hmk are available at all 
encoders and decoders. Here we assume instead that these channel gains are only known approximately 
at any node in the network. As we will see, the only effect of this change in available channel state 
information is to decrease the minimum constellation distance seen at the receivers. 

Formally, assume both transmitters and receivers have only access to estimates hmk of hmk satisfying 

\hmk-hmk\<e = 2-^-^--''^->^. (84) 

In other words, all transmitters and receivers have access to a max^ ^ nm^-bit quantization of the channel 
gains. Since we know a-priori that hmk ^ (1, 2], we can assume without loss of generality that hmk G (1, 2] 
as well. 

Each transmitter k forms the modulated symbol Umk from the message Wmk- From these modulated 
signals, the channel inputs 

Xi = h22Uu + hi2U21, 
X2 = h2lUi2 + hiiU22 
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are formed. In other words, the transmitters treat the estimated channel gains hmk as if they were the 
correct ones; the encoders are thus mismatched. The modulation process from wik to Uik is the same as 
in the matched case analyzed in Section IVI-Ai Since /i^^ < 4 and m^^, < 1/16, the resulting channel 
input Xk satisfies the unit average power constraint at the transmitters. 
The channel output at receiver one is 

yi = 2""/liiXi+2"i2/ll2X2 + 2l 

= (/iii/i222""Mn + hi2h2i2''''ug) + (/ii2/iii2""m^i + huhu2''''ug) + {h2hn2''''ul^ + z^), 

As in the matched case, the received signal consists of desired signals, interference signals, and signals 
treated as noise. For the third part treated as noise, we have 

|/ll2/^ll2"^^^/2'2l < 1, (85) 

since 

2"-«2'2e [0,1/4). 
The demodulator at receiver one searches for 



Su — I Mil, 
Si2 — I Ui2, 

Sio = 2""M^i + 2"i2m^2 

that minimizes 

\yx - huh22Sn - hi2h2iSi2 - huhuSiol. 

The demodulator then declares the minimizing 2~""sii and 2^"''^^si2 as the demodulated symbols and 
discards siq. Note that the entire demodulation process depends solely on the estimated channel gains 
hmk and not on the actual channel gains hmk- Furthermore, the demodulator is the maximum- likelihood 
detector only if the estimated channel gains coincide with the actual channel gains. Thus, the demodulator 
is mismatched. 

We now analyze the probability of error of this mismatched demodulator. There are two contributions 
to this probability of error. One is due to noise, the other one due to mismatched detection. Let d' be the 
minimum distance between any two noiseless estimated received signals (as assumed by the mismatched 
demodulator using hmk)- Let d be the maximum distance between the noiseless received signal yi — Zi — 
and the estimated received signal with the same channel inputs. The probability of error of 
the demodulator is then upper bounded by 

^{uik ^ uik for k e {1,2}) < 2P(zi + \hi2hu2''''u^2\ + d > d'/2) 

<2F{zi>d'/2-d-l), (86) 

where we have used (1851) . 

We start by upper bounding the mismatch distance d. We have 

d = max |/i222""Mii(/iii - hn) + h2i2''''u^2{hi2 - hu) 

+ hu2''''u^^{hu - ki) + hn2''''ug{h2 - /ii2)| 



< 4 ■ 2 ■ 2"" ■ i ■ e 



< 2, (87) 
where we have used (f84l) . that \umk\ < 1/4, and that \hmk\ < 2. 
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We continue by lower bounding the distance d' between the estimated received signal (i.e., as assumed 
by the mismatched detector) generated by the correct (sn, S12, Siq) and by any other triple {s\^, 3^2^ s'^q). 
By the triangle inequality, 

d' = min |/iii/i22(sii - s'n) + /ii2/i2i(si2 - s'^s) + /ii2/^ii(sio - sio)| 

(sii,Si2,Slo) 
7^(*ll'*12'*lo) 

> min \hiih22{sii - 4i) + ^12/^21(512 - + hi2hii{siQ - s\q)\ -3-5/2 

(sil,Sl2,Slo) 
7^(*'ll'*12'*'lo) 

>d-8, (88) 

where d denotes the minimum distance in the matched case as analyzed in Section IVI-AI Here we have 
used that 

kll - ■5n||/lll/l22 - hiih22\ < 2'"""^|(/lii - hu){h22 - h22) + h22{hii - hu) + hii{h22 - h22)\ 

< 2""-^ ■ 5 - e 

< 5/2 

by (f84l) . and similarly for the other two terms. 
Combining ([87]) and ([88]) shows that 

2 - 2 

By (l86l) . this shows that the probability of demodulation error is upper bounded by 

2P(zi > d/2 -d-1) < 2F{zi > d/2 - 7). (89) 

More generally, consider the correct decoding region, and label the incorrect decoding regions in increasing 
order of distance to the correct one, starting from i > 1 (see Fig. [14] in Section [VI-AI) . Since the minimum 
distance between any two received signal points is at least d, and taking the mismatch between the decoding 
metrics into account, the probability of the received signal falling into the ith such region is upper bounded 
by 

F{zi > £d'/4 -d-l)< F{zi > £{d - 8)/4 - 3). (90) 

Appendix D 
Proof of Lemma [IO] in Section [VIiBJ 

The inequalities (I48al) - (l48fl) have been already proved in |[T01 Lemma 5.2, Theorem 5.3]. Here we 



present the proof for inequalities (48g)-(48j). First, we establish the bound (48g). 



Define Smk[t] as the contribution of transmitter k at receiver m, corrupted by receiver noise Zm[t], i.e., 

Smk[t] = 2"^^h,nkXk[t] + z^[t]. (91) 

For block length T, we have, 

T{R22-e) <l{w22;yP) 

^ j( (T) (T) (T) \ 

< I[W22;y2 ,S\2 \X\ ',Wi2) 

— ^[W22,y2 1^12 \ Xi ,Wi2) 

= I [W22] 8)22 , S\2' I W12) 

= I{W22] I W12) + I{W22] 4? I Si?, W12) 

= h[sg^ I ^12) - h[zP) + /i(4? I 4?, ^12) - h[zf^), (92) 



where the first step follows from Fano's inequality. Again from Fano's inequality, we have, 

T{Rii + i?i2 -e)< I{wu, W12; 2/D 

= h[yr)-h[sg^\w,2). 

Adding ^ and (|93]) yields 

Using Fano's inequality at receiver two, we have 

T(i?2i + R22 -^)< I{w2i, W22; yP) 

< I[W21,W22]y2 \S\2\W12) 
= I[W21,W22;y2 ,-Si2 I W12) 

= 1(^21,^22; I W12) + l{w2i,w22;yP I sSP,wi2) 

= /i(sSP I W12) - h{zP) + h{yP I s''^\wi2) - /i(4P | ^21)- 
Moreover, Fano's inequality at receiver one yields 

T{R,,-e)<l{wu;yP) 

< l{wn; yf \ 4T\ W12, ^21, ^22) 

= I[Wu]yl '■521 I W^12, W^21,^i'22j 

= I [wu; s\^' , S21 1^12,^21,^22) 

= /i(siP,s^P I ^12,^21,^22) -^(■Sn^'52? I w^ii,«^i2, W21, W22) 

= h{sg^\w2^)+h{s^^,^\sg\w2^)-h{zr\zP). 

Adding ([95]) and (|96l) yields 

T(i?n+i?2i + R22 - 2e) 

= h{sg^ I w,2) - h{zP) + h{yP I 4?, ^12) + M4? I 4?, ^21) - h{zP,zP) 
Adding (|97l) and (|94l) derived earlier, we obtain 

T{2Ru + R12 + R21 + 2R22 - 4£) 

< h{yP) - 2h{z^P) + /.(4? I 4?,«^i2) - h{zP) + h{sg' I w,2) 

+ h{yP I 4?, ^12) + /.(4? I sg\w2,) - h{z^P,z^P) 
= h{yT^) - 2h{zP) + h{sg\ 4? I w^2) - h{zP) 

+ h{yP I 4?,«^i2) I sg\w2,)-h{zP,zP). 

Since 

U(AT) AT) I ^ , / (T) (T)N _ ./ . (T) (T) | ^ 
"-l'522''5l2 I ^12) — (^2;;^ , ^2 ) — (,^22, S22 ' -512 |^12j 

< H{W22) 

= TR22, 
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we obtain from (|98l) that 

T{2Ru + Ri2 + R21 + R22 - 4£) 

< /.(yD - 2/.(.r)) - h{zf^) + h{yP I 4?, ^12) + I 4?, ^21) 

< h{y^p) - /.(.D + 1 4?) - h{zP) + 1 4?) - 

T T / 92n22 A 



< -log(l + 22""/i2^ + 22"i^/iy +-log(l + 22"2i/i2 ' " "22 

+ 2 1 + 2^.4^. )- 

where the last inequality follows from the fact that i.i.d. Gaussian random variables maximize conditional 
differential entropy (see, e.g., [|27l Lemma 1]). Letting T — 00 and e now proves ( |48g[ ). Inequalities 
(I48hl)-([48j|) can be proved similarly. ■ 



Remark: We point out that, as in the deterministic case, (1991 ) is a key step to the derivation of the outer 
bound for the Gaussian X-channel. 

Appendix E 
Proof of Lemma [T3l in Section [Vn-B I 

By Fubini's theorem, we have 

f2 r2 /•2 r2 



h\\ = l J hi2 = l J h2i = l J h22 = l 
2 r2 r2 /•2 



HiiB) = / / / / lBihu,hi2,h2i,h22)dh22dh2idhi2dhi 

J/ni=l J h^2 = l J ho.-[=l J h22 = l 

t^{hiihi2, huh22, hi2h2i)dh22dh2idhi2dhu 

pZ j"i j"i j"i 

< I III '^3(90,91, 92)90^^1 dg2dgidgodhu 

J h\\=l J go = l J gi = l J g2 = l 
Hi 



hii = l J hi2 = l J h2\ = l J h22 = l 
4 ri 



< [ fi3{B)dhi 



<5, 

proving the lemma. ■ 
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